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OPERATIONISM AND THE CONCEPT OF PERCEPTION * 


WENDELL R. GARNER, HAROLD W. HAKE, AND CHARLES W. ERIKSEN 


The Johns Hopkins University 


The attitude of contemporary opera- 
tionists toward perceptual research has 
been well characterized recently by All- 
port. 
by stating “that a perception can be re- 
garded as nothing more nor less than a 
discriminatory response” (1, p. 53). In 
even simpler terms, the “reaction is the 
perception,” and thus the role of the re- 
searcher is simply to determine the con- 
ditions under which a discriminatory re- 
sponse is obtained. These conditions 
then define perception. Unfortunately, 
we have to agree that many psycholo- 
gists who consider themselves operation- 
ists do in fact accept this position to- 
ward perception. However, this posi- 
tion is not necessary from the tenets of 
operationism. In fact, we believe that 
this viewpoint is a perversion of the 
fundamentals of operationism as stated 
by its originators. 

The essence of the above position is 
that a concept of perception is not 
distinguishable from the operations on 
which it is based, and thus that percep- 
tion is indistinguishable from the re- 
sponses which indicate its existence and 
character. This idea springs from a 
restricted interpretation of Bridgman’s 
writings. For example, Bridgman states 


1 The preparation of this report was sup- 
ported in part under Contract NSori-166, Task 
Order 1, between the Office of Naval Research 
and The Johns Hopkins University. This is 
Report No. 166-I-201, under that contract. 


He has described their attitude 


that “The concept is synonymous with 
the corresponding set of operations” 
(4, p. 5). This widely quoted statement 
has been used by psychologists to justify 
their unwillingness to distinguish be- 
tween perceptions and responses, and to 
support their position that any set of 
responses leads to a concept about the 
properties of the perceptual system. 
However, to state that a concept is 
synonymous with a set of operations is 
not to state that any operation can pro- 
duce a concept. 

Furthermore, psychologists have ig- 
nored the fact that Bridgman is talking 
about a set of operations, not a single 
experimental operation. He later em- 
phasizes this distinction, stating that 


Operational definitions, in spite of their pre- 
cision, are in application without significance 
unless the situations to which they are ap- 
plied are sufficiently developed so that at least 
two methods are known of getting to the 
terminus. Definition of a phenomenon by the 
operations which produced it, taken naked 
and without further qualification, has an en- 
tirely specious precision, because it is a de- 
scription of a single isolated event (3, p. 248). 


Many operationists accept the sterile 
point of view described by Allport, and 
consider perception not to have any op- 
erationally determinable properties other 
than discrimination. It seems to us 
that the above quotations from Bridg- 
man do not require such a narrow point 
of view. It is true that if the only op- 
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eration allowed were the discriminating 
reaction, then it would be impossible to 
determine whether the perceptual proc- 
ess had any properties other than dis- 
crimination. But it is equally unjusti- 
fiable to state that perception is nothing 
more than the discriminating reaction, 
since other possibilities have not been 
excluded. Surely the perceptual process 
has more richness than simple discrimi- 
nation. 

It seems clear that many operationists 
who nominally subscribe to the narrow 
operational position also feel that per- 
ception is more than discrimination, 
since they do in fact ascribe other prop- 
erties to the perceptual process. It is 
equally unjustifiable to ascribe such ad- 
ditional properties, when they are as- 
cribed on the basis of single experiments 
whose designs are inadequate to de- 
termine the nature of these additional 
properties. It is our contention that 


additional properties of the perceptual 
system can and must be considered, but 
that operational experiments of a par- 


ticular type are necessary to determine 
the nature of these properties. 


PERCEPTION AS A CONCEPT 


We conceive of perception as an in- 
tervening process between stimuli and 
responses, as schematically illustrated in 
Fig. 1. We can directly observe only 
stimuli and responses and, therefore, 
perception can be known only as a con- 
cept whose properties are induced from 
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Fic. 1. A schematic of the perceptual prob- 
lem. The three systems operate in a causal 
relationship as indicated, although there may 
be interdependencies. Each system, however, 
may have independent properties. The prop- 
erties of the stimulus and response systems 
can be directly observed; those of the per- 
ceptual system must be inferred. 
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objectively determined relations between 
stimuli and responses. This statement 
does not in any sense imply that per- 
ception is identical to responses (or to 
stimuli). Indeed it is the purpose of 
this paper to indicate the kinds of op- 
erations which make it possible to dis- 
tinguish perception from responses, and 
to show that these operations are neces- 
sary if the concept of perception is to 
have any use or meaning. 

The discriminating reaction. We agree 
with contemporary operationists that the 
fundamental and prerequisite operation 
in any experiment on perception is to 
demonstrate a discrimination between 
stimuli on the basis of responses. In 
other words, it must be demonstrated 
that there is a contingency relationship 
between stimuli and responses. If such 
a relation is demonstrated, we know 
that a subject can use the same label 
for the same stimulus within a certain 
error tolerance. 

This operation alone, however, assures 
us only that we have a system which is 
operating and which is reliably assign- 
ing responses to the various stimuli. 
This operation provides us with so little 
information about perception that in- 
deed we cannot distinguish between per- 
ceptual and response processes. How- 
ever, if perception is considered a con- 
cept separate from the response system, 
there are rules for inductively determin- 
ing the properties of that concept, and 
they are the same as those for determin- 
ing the properties of any concept. 

Converging operations. The neces- 
sary condition which makes possible the 
determination of particular character- 
istics of any concept (including the con- 
cept of perception) is the use of what 
have been called converging operations 
(6). Converging operations may be 
thought of as any set of two or more ex- 
perimental operations which allow the 
selection or elimination of alternative 
hypotheses or concepts which could 
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explain an experimental result. They 
are called converging operations because 
they are not perfectly correlated and 
thus can converge on a single concept. 

To illustrate, let us assume an ex- 
periment similar to one reported by Mc- 
Ginnies (11) in which visual thresholds 
are determined for words with different 
emotional content. In our hypothetical 
experiment, we present four alternative 
stimulus words, tachistoscopically—fire, 
save, shit, fuck. The responses to these 
words are the verbal pronunciation of 
them. We find that the two vulgar 
words have higher thresholds than the 
two nonvulgar words. Such a result 
has usually been interpreted as indicat- 
ing that the perceptual system differ- 
entially discriminates on the basis of 
the emotional content of the perceived 
stimuli. There is, however, at least one 
other alternative explanation of these 
results—namely, that the difference in 
threshold is a result of a characteristic 
of the response system, which inhibits 
the verbalizing of some of these words. 
A converging operation which would dis- 
criminate between the two alternative 
hypotheses would be to present the 
same stimuli as before, but to pair these 
stimuli with responses such that vulgar 
responses are used for nonvulgar stimuli, 
and vice versa. This experiment in con- 
junction with that of the first would 
allow us to decide which of the two hy- 
potheses is correct, or if some combina- 
tion of them is. It should be noted that 
the two operations taken together pro- 
vide the convergence. One experiment 
does not converge on the other, but 
rather the two converge on a mutually 
acceptable result. 

Ideally, converging operations would 
be orthogonal (completely independent), 
since such operations are the most effi- 
cient. In practice, however, it is diffi- 
cult to obtain truly orthogonal opera- 
tions, because the world is so organized 
that all variables cannot be controlled 





151 


completely independently. This fact 
does not seriously change the nature of 
the problem, because a sufficient num- 
ber of partially converging operations 
can still provide precise delimitation of 
alternative concepts. 

Nevertheless, sets of operations can 
be considered more or less efficient in 
allowing the formation of definitive con- 
cepts. One class of operations can be 
excluded entirely from consideration. 
This class can be called parallel opera- 
tions. These operations select among 
alternative hypotheses along the same 
dimension, and thus cannot converge to 
a single concept. In our example above, 
we could do the entire experiment again 
using different words, but in which two 
are still vulgar and two nonvulgar. If 
this experiment produces the same re- 
sult as the first, we still do not know 
whether the difference in threshold is 
due to a characteristic of the perceptual 
system or of the response system. 

There are two special types of paral- 
lel operation which deserve mention. 
One of these is the repeat operation, 
in which the same experiment is re- 
peated at another time. Such an ex- 
periment does not allow convergence to 
a single concept unless the concept in- 
volves time as a variable. Another spe- 
cial type of parallel operation is the 
transform operation, in which one vari- 
able is simply a transformation of an- 
other. Again, in the illustration we 
have been using, suppose we retain the 
original set of words as the stimuli, but 
use four synonyms as responses—burn, 
keep, crap, screw. These synonyms 
used as responses can be considered 
transforms of the original words used 
as responses, and this experiment will 
not allow us to determine whether the 
differential effect exists in the perceptual 
system or in the response system. 

It should be obvious from the above 
discussion that the value of a set of 
converging operations depends less on 
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the nature of the operations themselves 
than on the alternative hypotheses or 
properties which are being considered. 
For example, we mentioned above the 
use of synonyms as an example of a 
transform operation. If, however, the 
synonyms were used as the stimuli, this 
experiment in conjunction with the first 
would converge to determine whether 
the differential thresholds were due to 
the letter configurations of the original 
words, rather than to their meanings. 
Thus these two experiments would be 
converging for that purpose, but would 
not be converging for purposes of deter- 
mining whether the effect was in the per- 
ceptual system or in the response sys- 
tem. 

Although a minimum of two converg- 
ing operations may define a concept, in 
practice it is rare that two are sufficient. 
In the first place, there are usually more 
alternative hypotheses than can be de- 
limited with two converging operations. 


In the second place, converging opera- 
tions are rarely orthogonal, but are usu- 


ally only partially converging. If there 
are enough partially converging opera- 
tions, we can still arrive at a single, 
well-defined concept, but more than two 
such operations will be necessary. 

It is quite legitimate to use assumed 
converging operations in place of opera- 
tions actually carried out. We know 
from previous research that many of 
the possible converging operations would 
work in a particular way if we actually 
tried them, and thus there is little point 
in trying them again. The fact that we 
use assumptions, however, should not 
allow us to lose sight of the fact that 
the validity of our concepts rests en- 
tirely on the validity of the operations, 
whether carried out or assumed. A con- 
cept has no meaning other than that de- 
rived from the operations on which it is 
based, and unless these operations are 
known, the concept cannot be known 
either. 


OPERATIONAL DISTINCTION BETWEEN 
RESPONSE AND PERCEPTION 


An important use of converging op- 
erations is to distinguish effects which 
exist in the response system from those 
which exist in the perceptual system. 
Since the primary function of percep- 
tual research is to determine something 
about the properties of the perceptual 
system, a major requirement in such re- 
search is to use converging operations 
which will eliminate the possibility that 
the outcome of an experiment is due to 
properties of the response system rather 
than of the perceptual system. 

One of the more important properties 
of the response system which can affect 
the outcome of a perceptual experiment 
(and possible interpretations about prop- 
erties of the perceptual system) is that 
of response differentiation. For exam- 
ple, if only one response is available to 
a subject, it is clearly impossible to 
demonstrate anything about perceptual 
discrimination. This principle is quite 
obvious when stated in such an extreme 
form, but it can also operate in other 
less obvious ways. For example, if the 
number of response categories is too 
small to demonstrate the perceptual dis- 
crimination capacity of a subject, then 
the outcome of the experiment will be 
limited by a property of the response 
system rather than by a property of the 
perceptual system. Thus, a converging 
operation which is required in many 
experiments is one which demonstrates 
that discrimination is invariant with re- 
spect to the number of response cate- 
gories and with respect to the discrimi- 
nability or differentiation of these cate- 
gories. Such converging operations must 
either be carried out or be assumed be- 
fore we can state anything about the 
limits of perceptual discrimination; if 
they are assumed, then the validity of 
the conclusion is limited by the validity 
of the assumption. 
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A second major property of the re- 
sponse system which can affect inter- 
pretation of experimental results is that 
of response availability. To some ex- 
tent this property can be considered as 
a special case of response differentia- 
tion. If a subject fails to use a par- 
ticular response which is defined as ap- 
propriate to a particular stimulus, we 
cannot state that there was a failure of 
the perceptual system. The failure of 
response can be due to sheer motor in- 
ability. It can also be due to such 
things as response inhibition (as in the 
case of vulgar words). That is to say, 
the response system can be affected by 
emotional factors just as well as the 
perceptual system can. 

There are other ways in which re- 
sponse availability can operate. For ex- 
ample, it is known that human subjects 
(and rats too) have a preference for 
some responses over others. These pref- 
erences can affect the apparent nature 
of the relation between stimuli and re- 
sponses, and, unless they are taken into 
account, can lead to misinterpretations 
about properties of the perceptual sys- 
tem. Subjects also exhibit sequential ef- 
fects in their responses and, if preferred 
sequences of responses conflict with the 
sequences of stimuli presented, misinter- 
pretations can again occur. This factor 
is commonly taken into consideration in 
psychophysical experiments, where truly 
random sequences of stimuli are rarely 
used. Rather, modified random se- 
quences are used which prevent long 
runs, since most subjects do not believe 
that long runs can occur by chance and 
thus are unwilling to use them. 

In summary, then, there are many 
ways in which properties of the response 
system can affect the outcome of a per- 
ceptual experiment and, unless converg- 
ing operations are used to delimit an 
effect specifically to the perceptual sys- 
tem, properties can be incorrectly as- 
cribed to that system. 
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CONVERGING OPERATIONS AND SOME 
CURRENT PROBLEMS IN PERCEPTION 


To illustrate more specifically the im- 
plications of this discussion for research 
on perception, we shall discuss three 
types of experiments in these terms, 
pointing out the kinds of converging op- 
erations which are necessary before the 
results of experiments can reasonably be 
ascribed to particular properties of the 
perceptual system. 

Subception. The concept of subcep- 
tion has been introduced by Lazarus 
and McCleary (10) to explain the re- 
sults of a certain class of experiments 
on perception. Typically, in these ex- 
periments, subjects make two different 
but simultaneous responses to stimuli 
from a set presented one at a time. 
Usually some of the stimuli have a 
meaning (frequently anxiety-producing) 
which is different from that of other 
stimuli in the set. This different mean- 
ing may be inherent in the words, or 
may be induced by conditioning pro- 
cedures. One response is overt, usually 
verbal. The other response is often 
physiological, nonverbalizable, and pre- 
sumably indicative of emotion. The 
GSR has been most frequently used. 
The result which subception purports to 
explain is that the nonverbalized re- 
sponse shows discrimination when the 
verbalized response does not. To ex- 
plain this result it is assumed that per- 
ceptual discrimination occurs subcon- 
sciously: thus the term subception, 
which ascribes a property to the per- 
ceptual system. . 

Underlying this definition of subcep- 
tion is the equation of consciousness 
with verbal report, a distinction that we 
are reluctant to accept without more ex- 
plicit specification of the converging op- 
erations on which it rests. But even 
assuming that these converging opera- 
tions can be specified, the inference of 
unconscious perceptual discrimination 
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from the typical subception experiment 
is not justified. It is not justified be- 
cause the converging operations are lack- 
ing that would permit us to ascribe the 
results to properties of the perceptual 
system as opposed to properties of the 
response system. The fact that the 
GSR is a nonverbally reportable re- 
sponse, is not sufficient cause for as- 
suming that the perceptual discrimina- 
tion is also not verbally reportable. 

Eriksen (5) has pointed out that the 
subception result can be looked at basi- 
cally as the demonstration of a partial 
correlation between the stimulus and 
the GSR, with the verbal response held 
constant. When the subception experi- 
ment is conceived this way, many prop- 
erties of the response system which could 
produce the subception result become 
more obvious. For example, Eriksen 
has noted that such factors as lack of 
verbal response differentiation, asyn- 
chronous response oscillation, and dif- 
ferential response strengths will satisfy 
the requirements for the partial corre- 
lation. 

When the problem is looked at this 
way it becomes obvious that a demon- 
strated partial correlation as indicated 
above does not allow us to infer the 
property of subconscious discrimination 
to the perceptual system. 

Rather, in order to demonstrate that 
there is perceptual discrimination which 
cannot be verbally described, it is neces- 
sary to show that the first-order corre- 
lation of stimuli with nonverbal re- 
sponses is greater than that between 
stimuli and verbal responses. But even 
if a single experiment gave this result, 
it would still be necessary to use con- 
verging operations which demonstrate 
that the correlation between stimuli and 
verbal response was not limited by the 
size or degree of differentiation of the 
response set. Bricker and Chapanis (2) 
have shown that more precise discrimi- 
nation of stimuli can be accomplished 


verbally if more alternative responses 
are allowed. If no verbal response sys- 
tem can be found which gives as good 
a correlation with stimuli as the GSR, 
then and only then can we conclude that 
additional perceptual discrimination is 
operating at a nonverbal level. 

Perceptual set. Many experiments 
have demonstrated that the nature of 
the relation between stimuli and re- 
sponses changes when the response sys- 
tem changes. A common interpretation 
of such experiments is that the set of 
responses provided the subject serves to 
produce a perceptually selective set. In 
many of these experiments, however, the 
necessary converging operations have not 
been undertaken to justify delimitation 
of the effect to the perceptual system. 

Suppose an experiment similar to one 
by Hyman and Hake (9), in which sub- 
jects are required to identify the form 
of stimuli presented tachistoscopically. 
In one condition subjects are told prior 
to the stimulus exposure that the form 
will be one of two particular alterna- 
tives. In another condition they are 
told that the form will be one of four. 
When the duration of exposure required 
for identification of the form is shorter 
with just two stimuli, a possible conclu- 
sion is that subjects adopt a more ac- 
curate perceptual set with the smaller 
number of stimuli. 

There is, however, an alternative ex- 
planation which requires no assump- 
tions about the accuracy of the percep- 
tual process, namely, that the amount 
of error in the response system de- 
creases with a decrease in number of 
response categories. One necessary con- 
verging operation, then, is to run an- 
other experiment in which four stimuli 
are presented, but only two response 
categories are allowed. If the accuracy 
of identification in this case is identi- 
cal to the two stimulus-two response 
case, then we must assume that the ef- 
fect is due entirely to a characteristic 
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of the response system. To complete 
the set of converging operations, it 
would be necessary also to present two 
stimuli but to allow four response cate- 
gories. If the responses, however, are 
direct descriptions of the stimuli, this 
last operation would be meaningless to 
carry out. 

Actually, there is still another set of 
converging operations necessary if two 
responses are used with four stimuli. 
If the nature of the responses and 
stimuli are such that the subject can 
observe one of two aspects of the stimu- 
lus, then this operation is parallel to 
that of using two stimuli and two re- 
sponses. For example, suppose that the 
four stimuli were red square, blue 
square, red circle, blue circle; and the 
two responses were red, blue. For all 
practical purposes, there are just two 
stimuli, not four, and this operation 
would not allow us to determine where 
the effect exists. If the responses were 
abstract, such as letters of the alphabet, 
and were not assigned to the stimuli by 
just color or form, the operation would 
be converging. One system for handling 
this problem is to use the two responses 
with all possible pairs of the four stimuli 
to determine the extent to which mean- 
ingful grouping of the stimuli affects 
discrimination. 

Sensory scales. One area of percep- 
tual research where these principles of 
converging operations are most difficult 
to apply, and have been applied least, 
is that of sensory magnitude scales. Psy- 
chological scales have been produced 
for pitch, loudness, weight, brightness, 
length, and many other perceptual at- 
tributes. In most cases it is assumed 
that the resultant scale tells us about a 
property of the perceptual system, but 
we believe that very few of the experi- 
ments have included the necessary con- 
verging operations to ensure that the 
final function truly describes a percep- 
tual process. 
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To illustrate, let us suppose an ex- 
periment similar to one reported by 
Reese et al. (12). We provide a large 
circular, vertical display on which ap- 
pears a single line radiating from the 
center. This line is set to various an- 
gular positions, and subjects are asked 
to estimate the angle, using as responses 
the whole numbers from 0 to 360. 
When we have obtained many estimates, 
and averaged them for given physical 
settings, we plot the estimated angle 
against the presented angle. At first 
glance it would appear that we could 
call such a function a_ psychological 
scale of inclination having interval prop- 
erties. We have no converging opera- 
tions, however, to indicate that the sub- 
jects actually used the numerical re- 
sponse system in such a way as to 
reflect the perceived magnitudes of an- 
gles. People have had considerable ex- 
perience with such estimates, particu- 
larly with clocks, and in the course of 
this experience they have learned to 
identify certain perceptions with cer- 
tain numbers. The fact that later they 
can do so in an experiment does not 
alter the fact that they are using the 
numbers simply as learned identifying 
responses. Since the subjects have had 
experience with the 0 to 12 scale but 
not with the 0 to 360 scale, it might be 
argued that the new number system 
would actually reflect properties of the 
perceptual system. However, as we 
pointed out above, one operation which 
is a transform of another does not 
satisfy the requirements of converging 
operations. 

There is an operation which would ap- 
pear to satisfy the requirements of con- 
vergence for the property under consid- 
eration—an interval property of the per- 
ceptual system. This operation would 
be to have the subjects set series of 
three radiating lines in such a way that 
two equal angular intervals are pro- 
vided. In this case a number system is 
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not used, and thus the chance that the 
numbers are used simply as identifying 
labels would seem to be minimized. 
Subjects are so accustomed to using 
numbers in this situation, however, that 
they may very well verbalize the num- 
bers to themselves, and thus once again 
we have a transform rather than a con- 
verging operation. 

Actually, in situations where so much 
learning of objectively assignable num- 
bers to stimulus attributes has occurred, 
it is doubtful that we can ever suffi- 
ciently satisfy the requirements of con- 
verging operations to be sure that we 
are measuring a metric property of the 
perceptual system, rather than of the 
objective stimulus system. It would be 
necessary to get subjects who have had 
no previous experience in assigning num- 
bers to the continuum under considera- 
tion. 

Gibson and Bergman (8) have clearly 
recognized the nature of this problem in 


their experiments on estimation of dis- 


tance. Consistent underestimations of 
distance can be corrected by one or two 
correction trials (the correction general- 
izing to all distances used in the experi- 
ment), and the authors feel that the 
correction has simply changed the sub- 
jects’ conceptual scale of yards. In 
other words, they are assuming that the 
change was not in the perceptual sys- 
tem, but rather in the response system. 
Given a situation like this, we can make 
statements about the conceptual scale 
of numbers which the subject has, and 
its relation to physically measured dis- 
tance, but we cannot know anything 
about the metric properties of the per- 
ceptual system. 

There are other types of psychological 
dimension that make the problems more 
obscure because there is no commonly 
accepted number system which the usual 
subject has been trained to use. Of 
these, the one dimension that has re- 
ceived the greatest attention is loud- 


ness. Presumably, subjects have had 
little or no training in using a number 
scale to describe the loudness of a tone or 
noise. So when a subject is asked to as- 
sign numbers to the loudnesses of vari- 
ous tones, he should be able to respond 
only on the basis of the perceived mag- 
nitudes. This position has been taken 
by several psychologists, particularly 
Stevens (13). However, when subjects 
are required to use a set of responses 
which constitute a numerical scale, we 
have no assurance that they in fact use 
the response continuum in such a way 
as to reflect the ratio or interval prop- 
erties of the perceived loudness cor- 
rectly. The question is not whether 
they can use the number scale correctly, 
when using it as an abstract scale, but 
rather whether they use it in such a 
way as to reflect a metric property of 
the perceptual system. 

Suppose we have done an experiment 
in which direct numerical estimates of 
the loudnesses of tones, or of the rela- 
tive loudnesses of two different tones, 
are made. From these estimates we can 
construct a functional relation which we 
can tentatively call a psychological scale 
of loudness. Having no assurance that 
the number scale was used to reflect the 
metric properties of the perceptual sys- 
tem—the numbers could have been used 
as simple identifying responses, or per- 
haps as ordered responses—we need to 
carry out one or more operations which 
converge to allow us to delimit a prop- 
erty of the perceptual system. We 
could ask our subjects to produce tones 
which sound half, or a quarter, as loud 
as other tones which the experimenter 
provides. Such an operation does not 
converge with the former, however, since 
the number system whose use is in doubt 
is the same in both instances. 

A considerably better converging op- 
eration is one that does not require the 
use of numbers at all. If a fractiona- 
tion or direct estimation has been car- 





OPERATIONISM AND PERCEPTION 


ried out, and if the resultant function is 
truly a loudness scale with the proper- 
ties of a ratio scale, then we should be 
able to determine a set of equal interval 
points on our scale. Now if we ask our 
subject to set a series of tones at in- 
tensities which provide a series of equal 
intervals of loudnesses, these equal in- 
tervals should check with those obtained 
from the fractionation experiments. In 
the one instance in which such a com- 
parison has been made, Garner (7) 
showed that the results of the two ex- 
periments do not check; the two experi- 
mental operations did not converge to 
a single concept about the metric prop- 
erties of the perceptual system. Garner 
also showed that if it is assumed only 
that the fractionation experiments pro- 
vided equal ratios—not necessarily with 
the stated magnitude—the two experi- 
ments can converge to a single loudness 
scale. This scale is quite different from 
that obtained with numerical estimates, 
however, and its properties and uses are 
different from those assumed for “scaies” 
obtained from direct estimates. 

The major point of this section is not 
to prove that psychological magnitude 
scales are impossible to construct but 
simply to point out that before we can 
ascribe a property to the perceptual sys- 
tem as refined as that of a ratio scale, 
certain minimal converging operations 
are required—operations that in prac- 
tice have rarely been carried out. Fur- 
thermore, it is not sufficient to carry out 
the same experiment twice, or to carry 
out a slightly modified experiment which 
provides only transform operations. The 
operations must truly be converging for 
the pertinent assumed property of the 
perceptual system. Until these opera- 
tions are carried out, we can have 
only functional relations, not magnitude 
scales.” 


2It should be pointed out that there are 
other scaling techniques for which the ascribed 
properties of the perceptual system are not at 


CoNCLUSION 


Our emphasis on the use of carefully 
planned operations in order to isolate 
properties of perceptual behavior should 
not be taken to mean that operation- 
ism properly used tends to minimize the 
importance of perception. We believe 
simply that its proper use rigorously de- 
fines those properties which validly can 
be assigned to perception. 

This argument holds for all aspects 
of behavior which can be described as 
perceptual. For example, the compari- 
son usually made betwen the rich, al- 
though unreliable, data provided by in- 
trospection and the sterile, although 
rigorous, data provided by operationism 
is invalid for two reasons: first, because 
the basic position taken by many op- 
erationists does not indicate the real 
possibilities inherent in this position; 
second, and more important, because 
the data produced by introspection can 
have no meaning independent of the 
operations used to produce them. 

For example, two aspects of the in- 
trospective method have been claimed 
to have special advantages. First, it is 
claimed that a much more detailed de- 
scription of experience can be reported 
by a subject using unrestricted re- 
sponses. This problem is not unique 
with introspection. Operationism does 
not require that responses be restricted 
in any way; it only points out that with 
unrestricted response sets, it is difficult 
or impossible to separate properties of 
the response system from those of the 
perceptual system. 

A second claimed advantage of intro- 
spection is that subjects can be in- 
structed to confine their attention to 


variance with the converging operations car- 
ried out. The discriminative scaling tech- 
niques, particularly as developed by Thur- 
stone, satisfy these requirements as long as no 
attempt is made to assume that the resultant 
scales reflect magnitude of perceptions rather 
than discriminability of perceptions. 
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facts of subjective experience, i.e., the 
stimulus error can be avoided. There 
is nothing about operationism which ex- 
cludes this possibility, either. The ef- 
fects of the subject’s set produced by 
instruction and training are important 
experimental problems in perception. 
We insist, though, that if these effects 
are in the perceptual system, then op- 
erations can be devised to demonstrate 
this fact. These operations must dem- 
onstrate that the effect of instructions 
is not merely to change the kinds of re- 
sponses that subjects use, nor simply to 
decrease the correlation between stimuli 
and responses by causing subjects to 
respond to factors other than stimuli. 
That is to say, the methods of intro- 
spection are operations, and as such 
cannot lead to valid concepts that are 
independent of these operations. Intro- 
spection and the data produced by it do 
not lie outside the scope of operationism 
as here conceived. 

We also feel that there is no need to 
minimize experimental questions about 
“awareness.” For humans, at least, 
awareness is undoubtedly one aspect of 
perception. We insist, however, that 
the awareness of a subject can be no 
more than a property of his perceptual 
system, and that it can be specified 
only in terms of a set of converging op- 
erations. 

In summary, we believe that our po- 
sition is truly one of operationism. We 
believe that a concept has no meaning 
beyond that obtained from the opera- 
tions on which it is based. This state- 
ment does not mean that any set of op- 
erations can lead to a concept. Nor 
does it mean that the complexity and 
usefulness of concepts derived from op- 
erations are necessarily limited in any 
way. In practice they are limited only 
by the ingenuity of the individual ex- 
perimenter to devise appropriate con- 
verging operations. 


SUMMARY 


Perception is conceived as a process 
intervening between stimuli and re- 
sponses. As such it can be viewed as a 
concept whose properties may be de- 
limited by converging operations. Con- 
verging operations are any set of experi- 
mental operations which eliminate al- 
ternative hypotheses and which can lead 
to a concept which is not uniquely 
identified with any one of the original 
operations, but is defined by the results 
of all operations performed. Thus con- 
verging operations can lead to concepts 
of processes which are not directly ob- 
servable. For example, converging op- 
erations can be used to describe proper- 
ties of the perceptual process which are 
distinct from those of the response sys- 
tem directly observed. 

Illustrations from current experimen- 
tal problems in perception indicate how 
some response characteristics may be 
isolated from perceptual properties, and 
vice versa. Some of these properties 
have been ascribed to perception with- 
out supporting converging operations by 
researchers dissatisfied by the sterility 
of operationism as it is commonly, but 
mistakenly, conceived. 
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The proposition that perception is in- 
fluenced by object value and perceiver 
need has enjoyed an exciting career 
since it was given prominence by Bruner 
and Goodman in an oral and, later, a 
printed report (3) of dramatic differ- 
ences between rich and poor children in 
their judgments of coin sizes. Whether 
that study or subsequent ones can be 
said to have upheld the proposition may 
be questionable; but the effect on psy- 
chologists is beyond doubt. They were 
refreshed and stimulated. The ideas, 
the experimental approach, and the re- 
sults were new, and quite obviously ap- 
pealed to current needs. 

It was with a little surprise that, 
perhaps two years ago, while ruffling 
through the time-blurred pages of the 
Psychological Review of 1906, I ran 
across an article which seemed to bear 
some kinship to contemporary studies 
but which had not, as far as I was 
aware, been referred to in the recent 
literature. That accident started a train 
of thought which, with the aid of a few 
other bits of historical material, has led 
to the present paper. Very briefly, what 
I have to say is this. First, there are 
certain points of agreement between sev- 
eral coin studies, old and new, but the 
agreement is not such as to encourage 
the Bruner-Goodman theory that per- 
ception itself is distorted by value and 
need, at least directly. Second, there 
may be some advantage in applying the 
concept of schemata to the coin per- 
ception problem, and to other percep- 
tual problems as well. The concept, 
which goes back to Head and Bartlett, 


has been very recently discussed in this 
Journal by Vernon (18). 


Otp LIGHT on A NEw ARGUMENT 


The 1906 paper alluded to above was 
by Robert MacDougall (11). It tended 
to show that the symbolic difference 
between two different denominations of 
paper money ($1 and $10) ‘is reflected 
in judgments of their weight. “How 
many bills together equal the weight of 
a silver dollar?” was the question Mac- 
Dougall put to his schoolgirl subjects. 
To avoid suggestion effects on the main 
comparison he wished to make, he speci- 
fied only the one denomination or the 
other when addressing his query to any 
particular group. In either case, the 
correct answer is about 20. The guesses 
obtained ranged from 5 to a wild 10,000 
or so. Not overly concerned about the 
range, MacDougall thought that the av- 
erages and medians favored the view 
that monetary value has a positive ef- 
fect on estimates of physical quantity. 
Some of his data are presented in 
Table 1. 


TABLE 1 


EsTIMATES OF NUMBER OF BiILLs NEEDED TO 
EguaL WEIGHT oF SILVER DOLLAR, 
MacDouGa_t (10) 








$1 





Avg 





98 
99 
84 
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MacDougall’s inquiry does not stand 
alone at that period. It was designed, 
in fact, to follow up a note by J. F. 
Messenger in Science in 1902 (12) which 
had elicited at least two other brief 
communications in the same journal, 
one by Pierce in 1902 (16) and one by 
Slosson in 1903 (17). All three of these 
writers offer statistical proof that the 
weight of paper money is underesti- 
mated when compared in memory with 
coins of gold or silver—a result which 
may or may not agree with value theo- 
ries of perception, depending upon how 
value is defined. But there should cer- 
tainly be a hearty welcome from that 
quarter for Messenger’s finding that 
university students and faculty mem- 
bers considered $1 bills lighter than 
those of higher denomination. 

Half a century ago, then, the minds 
of American psychologists were busy 
with a question which, in slightly al- 
tered form, has concerned us during the 
last decade. The technique of these 
early studies may be criticized, and it 
may be doubted whether the numerical 
guesses produced by the subjects de- 
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serve to be taken literally; but there is 
a relevance to present-day research on 
perception of valued objects which 
should not be ignored simply because 
the question then was cast in terms of 
memory judgments unaided by sensory 
inspection. The problem was the same. 

In 1913 G. C. Myers published a 
monograph on incidental memory (13) 
which is directly pertinent to recent de- 
velopments. Among other things he in- 
vestigated the accuracy with which peo- 
ple estimate the sizes of familiar coins. 
He presented his numerous subjects 
with a sheet of cardboard bearing com- 
pass-drawn circles ranging in diameter 
from 9 to 44 millimeters by 1-millimeter 
steps, and asked them to select from 
these the six circles corresponding most 
nearly in size to penny, nickel, dime, 
quarter, half-dollar, and dollar.* In 


1 He considered the circles with diameters 
of 19, 21, 18, 24, 31, and 38 mm. as fairly 
representing the sizes of the penny, nickel, 
dime, quarter, half-dollar, and dollar, respec- 
tively, the precise diameters of which he gives 
on the authority of the U. S. Treasury as 
19.05, 21.2, 17.9, 24.25, 30.6, and 38.1 mm. 


TABLE 2 


EsTIMATES OF DIAMETERS OF Corns (IN MILLIMETERS) 
BASED ON MEMORY AND VISUAL INSPECTION 








Nickel 
(21.2) 


Cent 
(19.0) 


Quarter 
(24.1) 


Dime 
(17.8) 


Half Dollar 
(30.5) 


Standards 





Memory 





Myers (N =58-59) 15 

Bruner and Goodman Size = Area? 18.8 
(N =20) Size = Diam. ? 19.9 

Carter and Schooler (N =48) 16.6 





nspection 





34.6 
39.5 


23.6 28.2 
26.3 33.0 
22.7 28 

21.1 25.6 
21.1 24.4 


20.2 
23.0 


20.4 
22.1 
19.3 
16.4 18.1 
16.6 18 


Bruner and Goodman Size = Area? 
(N =20) Size = Diam. ? 

Bruner and Rodrigues (N =30) 

Carter and Schooler (N =48) 

Carter and Schooler Cardboard Discs* 
(N =48) if 


r 


* The diameters of the discs were 17.5, 18.7, 21.2, 24.1, 30.6, thus differing slightly from the diameters of the coins, 


33.5 
31.9 
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Table 2, alongside some modern data, I 
have entered his results for fifth-grade 
boys and girls, converting his error fig- 
ures by a simple calculation into aver- 
age estimated diameters. The agree- 
ment of Myers with Carter and Schooler 
(7) is very close. Though the corre- 
spondence with Bruner and Goodman is 
less close, it is still pretty good, pro- 
vided that we assume that their “size” 
equals area rather than diameter—a 
problem of interpretation to be dis- 
cussed below. 

The features of the data in the first 
half of Table 2 most deserving our at- 
tention are brought out in Fig. 1. The 
memory estimates from all three studies 
follow the same slope, deviating from 
the slope of perfect match (indicated 
by the heavy line) in such a way that 
the size range between largest and 
smallest coins is greater than it would 
be if the estimates were perfectly ac- 
curate. According to Myers and ac- 


cording to Carter and Schooler, the 
smaller coins are underestimated and 
the larger overestimated. According to 
Bruner and Goodman, all sizes are over- 
estimated, but slope and range are not 
thereby thrown out of agreement with 


the other two sources of data. In sum, 
three independent studies of children at 
about the fifth-grade level agree that in 
making size estimates of familiar coins 
from memory the children produce a 
correct order of coins according to size, 
and slightly exaggerate the differences 
between them. 

How should these facts be interpreted? 
Bruner and Rodrigues (5), fighting a 
kind of rear-guard action against Carter 
and Schooler in defense of some earlier 
conclusions, have argued that the rela- 
tively greater exaggeration of size in the 
estimates of coins of larger denomina- 
tion is further evidence for the hypothe- 
sis that symbolic value influences the 
estimate of physical magnitude. The 
argument is weak, and it seems to dis- 
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regard the fact that smaller coins may 
be underestimated; the dime, for in- 
stance, is underestimated, and under- 
estimated to a greater degree than 
either penny or nickel, according to 
Myers, and Carter and Schooler, as 
Fig. 1 and Table 2 show. Should we 
not say that memory, in stretching out 
the range of coin sizes, commits a 
“good” error? If we attend merely to 
individual coins, we note overestimation 
here and underestimation there; but if 
we attend to the whole series, we see 
that the order of sizes is maintained, 
perhaps guaranteed by the exaggeration 
of differences. After all, is it not, prac- 
tically, more important to realize that 
the dime is smaller than the penny than 
to know either the absolute sizes or 
the exact amount of difference between 
them? One would scarcely deny that 
symbolic value enters into the construc- 
tion of such a functional schema. But 
how? The exaggeration of differences 
would be more useful than sheer magni- 
fication of individual items, and this is 
what appears to happen. 

Perhaps it might be laid down as a 
general principle that, wherever it is im- 
portant to have clarity, there will be 
a tendency to exaggerate differences 
within any closely graded series, and 
thus, in a case like the present, lead to 
a slope steeper than the slope of per- 
fect match. In the case of series of 
few members and large differences be- 
tween adjoining members, there might 
be some lowering of the slope, since this 
could be afforded without the risk of 
confusion. But the coin series is a 
fairly closely graded one, and to keep 
them separate in memory seems to re- 
quire some exaggeration of difference. 

Before going any farther in the dis- 
cussion, let me explain a doubtful as- 
pect of Table 2, indicated by the ques- 
tion marks. In trying to put all avail- 
able information into identical form as 
millimeters of diameter, I had to treat 
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the percentages of deviation published 
by Bruner and Goodman according to 
two different assumptions. Since they 
are never specific about it, they could 
mean either area or diameter when they 
refer to “size.” Carter and Schooler ap- 
parently take them to mean diameter. 
The subsequent Bruner and Rodrigues 
article does not deny that assumption. 
Nevertheless, it is my guess that “size” 


Accuracy of memory for size of coins as shown by three independent studies. 


in the Bruner and Goodman article 
means area. My reason is that the di- 
ameters derived according to this as- 
sumption from their percentage figures 
harmonize better with the other data in 
Table 2, including those from Bruner 
and Rodrigues, than do diameters de- 
rived according to the other assump- 
tion. Incidentally, the meaning of the 
percentage figures in the Bruner and 
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Goodman study is troubled by the same 
ambiguity. If two investigators ob- 
tained precisely the same results but 
reported percentage deviations on the 
two different bases, one the one way 
and one the other, the one reporting on 
the basis of diameter would appear to 
have deviations roughly half as large as 
the one reporting on the basis of area. 


Tue ScHEMA CONCEPT 


In the preceding discussion of size 
estimates I once used the term schema, 
in what I believe is the sense recom- 
mended by Vernon in her recent topical 
and suggestive paper on perception. No 
doubt there are other terms that would 
do as well, all meaning some kind of 
cognitive disposition not in itself di- 
rectly experienced but manifesting itself 
in imagining, remembering, perceiving, 
and acting. This is not the place to 
ferret out the ancestors of the term and 
all the contemporary kith and kin. 
Suffice it to say that Bartlett in his 
book on remembering (1) took it over 
from Head, with whom he had discussed 
it for years, gave it greater generality 
and a more psychological orientation, 
and passed it on to others who found 
it useful, especially in Great Britain. 
The reader interested in more detail 
should turn to Bartlett (1), Oldfield 
(14), Oldfield and Zangwill (15), and 
follow up some of their references. The 
schema is organically built up out of 
past experiences and reactions, it func- 
tions as an orienting and stabilizing fac- 
tor in new contacts with the environ- 
ment, and, although it is constantly 
subject to change both because of or- 
ganismic dynamics and environmental 
variations, it may come to govern be- 
havior quite rigidly in exceptional cases. 
As Vernon says, “The essential point to 
note is that the use of a concept such 
as that of the schema reflects the fact 
that our percepts, thoughts, and behav- 
ior are on the whole consistent and 
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orderly—reasonably appropriate though 
not necessarily logical—and that they 
are consistently related to previous acts 
of perceiving, thinking, and behaving” 
(18, p. 181). 

Now, schemata appear to be inade- 
quately appreciated. Brunswik, for ex- 
ample, betrays surprise at the results of 
an experiment by Bolles and Bailey 
showing how extremely little the direct 
vision of familiar objects improves esti- 
mates of size based upon mere verbal 
identification alone (6). 

A perfectly accurate schema, of course, 
could not be improved upon by any 
amount of sensory inspection, and per- 
haps there are many schemata so nearly 
accurate that improvement would be 
difficult. But everyday observation sug- 
gests that a schema may be resistant to 
improvement even when there is plenty 
of room for it. One reason is that the 
presence of a schema obviates the ne- 
cessity of a close examination of ob- 
jects, so long as it works within the 
wide limits of tolerance usually per- 
mitted by the environment; the name 
of an object is about as informative as 
sensory commerce with it, if the sen- 
sory commerce functions chiefly as a 
reminder to the observer of the appro- 
priate cognitive reference file, as it often 
does. 

Such reflections lead to the hypothe- 
sis that accuracy of size estimation will 
be less for perceived objects belonging 
to a definite schema, unless the schema 
has been very carefully developed, than 
for perceived objects lacking such a 
schema. Figure 2 supports this hy- 
pothesis with data drawn from Carter 
and Schooler. Inspection of the coins 
does improve the estimates based upon 
the schema operating in memory, but 
the estimates for the cardboard discs, 
which presumably are less schematized 
than the coins, are more accurate still. 
The evidence is admittedly slender, but, 
such as it is, it is not contradicted by 





Corn PERCEPTION STUDIES 





SCHOOLER 


IN MM. 


oe 
uJ 
| a 
uJ 
= 
$ 
Qa 
fo) 
uJ 
a 
<= 
= 
e 
” 
WwW 





A 


DATA FROM CARTER & 








22 


ACTUAL DIAMETER 


Comparative accuracy of memory and immediate perception. 


Fic. 2. 


the data from Bruner and Goodman; if 
anything it is only too well supported. 
One expects sensory inspection to result 
in some improvement where the memory 
estimate is inaccurate; but, contrary to 
this expectation, just the opposite hap- 
pened in the Bruner and Goodman 
study, as nearly everybody knows. On 
the other hand, the estimation of card- 
board discs, both in this study and in 


24 26 28 
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the later one by Bruner and Rodrigues, 
was very accurate indeed. 


EMOTIONAL EFFECTS? 


I have argued that under ordinary 
circumstances one would expect esti- 
mates of size to be progressively better 
as one passes from an estimate based 
upon a memory schema, through one 
representing the interaction of schema 
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and direct sensory inspection, to one 
based primarily upon direct sensory in- 
spection and relatively little upon a 
schema, as illustrated in Fig. 2. But 
the hypothesis is not prepared to cope 
with emotional perturbations, however 
caused. 

It is not exactly clear what the con- 
ditions of the Bruner and Goodman ex- 
periment were from the point of view 
of the psychology of the subjects. The 
children were selected as representing 
two distinct economic classes, and it was 
assumed that the class membership im- 
plied a predictable state of need with 
regard to money, the poor being sup- 
posedly greedier for it than the rich, 
and tending, therefore, to overestimate 
coin sizes, especially when the coins had 
taken on burning reality by being placed 
within their very grasp. But this theo- 
retical position is modified in the Bruner 
and Rodrigues paper; it is confessed 
that class membership is an unreliable 
index of need state, and the reader is 
urged to put more trust in the type of 
approach “where a money-value system 
is built up and extinguished experi- 
mentally,” as in the study by Lambert, 
Solomon, and Watson (10). There was, 
apparently, no deliberate manipulation 
of the value system by Bruner and 
Goodman in their treatment of their 
subjects or in the experimental arrange- 
ments. The merest hint of such a 
variable is found in their manner of 
presenting the coins. These were pre- 
sented, verbally and materially, in as- 
cending and descending order of value 
rather than randomly or in order of size. 
Whether this procedure accounts for the 
slight tendency to dislocate the dime 
from its proper relative position in the 
size series is an open question. As to 
producing a general overestimation of 
size for all the coins, however, it would 
scarcely suffice. Were the children emo- 
tionally aroused in any way by the ex- 
perimenters when the coins were pre- 
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sented, as might conceivably have hap- 
pened if there had been, for instance, a 
promise of monetary reward? Appar- 
ently not. The conditions of the experi- 
ment were, presumably, of a neutral. 
strictly psychophysical kind, implying 
no reward and arousing no excitement 
except such as might have been derived 
spontaneously from the need states ap- 
propriate to the rich and the poor. The 
startling results, now that the class- 
membership hypothesis has been aban- 
doned, are left hanging without specific 
explanation. 

The possibility of emotion in the 
Lambert, Solomon, and Watson study is 
much clearer. Very young children (3 
to 5 years) were used as _ subjects. 
Those in the experimental group re- 
peatedly obtained a poker chip by turn- 
ing a crank a certain number of times 
and then got candy by inserting the 
poker chip into a slot; the control sub- 
jects likewise turned a crank and re- 
ceived candy, but without the inter- 
mediation of the poker chip. After ten 
days the experimental group, having 
been often reinforced (10 or 50 times), 
displayed a significantly higher size esti- 
mation of the poker chip than in pretest 
trials or after extinction; interestingly 
enough, there was also a slight rise in 
the control group. A repetition of the 
experiment, with certain modifications, 
produced similar results, as reported by 
Lambert and Lambert (9). Now, noth- 
ing is said in the Lambert studies about 
the emotions of these little children. 
Nevertheless, it may be fair to guess 
that the process of reinforcement gener- 
ated some degree of pleasant excite- 
ment, whereas the extinction experience 
may have been somewhat sobering. The 
former emotional condition might have 
been more conducive to an expansive 
sort of inaccuracy, especially in the 
presence of the stimulating poker chip, 
than the latter. At any rate, it is not 
unreasonable to look for an explanation 
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of the expanded estimates in the gen- 
eral pleasant emotion induced in both 
sets of children by the interesting and 
rewarding game, and especially induced 
in the experimental group by the fact 
that the poker chip which they had to 
judge was an intrinsic part of the game, 
suggesting by its presence still more 
candy or other exciting outcome. Per- 
haps this is what is meant by Lambert 
and Lambert when commenting on a 
variation of the experiment in these 
words: “Still another way of discussing 
these data is that there is an effect at 
least analogous to stimulus intensity in- 
volved. That is, the white token is a 
stronger stimulus, at least in its visceral 
prededications, and it results in an am- 
plification of the judgment response” 
(9, p. 509). One should not expect 
children of 3 to 5 years of age to be 
very critical in making the sensorimotor 
adjustments required in these experi- 
ments for indicating their judgments, 
when they are under the stress of pleas- 
ant excitement. That the error happens 
to come out in the direction of a larger 
estimate rather than a smaller may be 
an expression of an affective law, though 
at the moment I can think of no bet- 
ter evidence for such a law than the fa- 
miliar rough observations concerning ex- 
pansion of the body and bodily gestures 
in sthenic states, fluctuations of hand- 
writing size with changes of mood, the 
general overplus of activity in excited 
children, and so on. 


SUMMARY AND CONCLUDING REMARKS 


The following principal conclusions 
emerge from the present discussion: 
(a) The memory schema for coin sizes, 
while stretching out the total range and 
slightly exaggerating the differences be- 
tween coins, preserves the correct order; 
(b) there is no evidence that this order 
is significantly disturbed by values and 
needs; (c) a memory schema, such as 
that for coin size, tends to resist correc- 
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tion by sensory contact with objects, 
contact serving as little more than a 
fleeting reminder, like a name; (d) de- 
partures from accurate size estimation 
which have been taken to favor the hy- 
pothesis that increase of value induces 
phenomenal magnification might better 
be interpreted as the result of the inter- 
ference of expansive emotional states 
with meticulous adjustments. 

The above conclusions refer to cer- 
tain specific studies and to results stated 
in the form of averages. There is no 
intent to deny that values and needs 
may influence perception in some way, 
though presumably more through sche- 
mata than directly. The experimental 
evidence is weak, however, and very 
fragmentary. 

One final point may be made. If a 
strong case is ever to be developed for 
need and value effects on perception, it 
may be supposed that it will have to be 
founded on the careful analysis of indi- 
vidual data. Perceivers are always in- 
dividuals, and schemata, too, are indi- 
vidual matters, even though there may 
be percepts and schemata which are 
widely shared in a society or species. 
Perhaps it is not scientifically desirable 
to restrict ourselves so narrowly to av- 
erage data, to neat tabulations of meas- 
urements and tests of significance, while 
neglecting to describe fully the general 
behavior of our subjects, and neglecting, 
in fact, to study the pattern of data for 
each individual. Klein, Schlesinger, and 
Meister (8) have done us a service, I 
believe, in pointing out the overwhelm- 
ing importance of the individual differ- 
ences in their own value-perception 
study, and have properly emphasized 
that mass statistics may completely hide 
the value determinants which may be 
operating quite powerfully in individuals 
taken singly. 
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In the report on primary mental 
abilities found in his first large factor 
analysis, Thurstone listed a factor whose 
unique property he thought to be an 
ability to reason under restrictive con- 
ditions (11). Its largest loading was in 
a test of arithmetic reasoning. The same 
factor was not subsequently identified 
as such in his later analyses. 

In the AAF Aviation Psychology Pro- 
gram research during World War II, a 
factor that was found repeatedly as a 
consistent feature of arithmetic-reason- 
ing tests was called general reasoning 
(7). Because of the unusual variety of 
tests found related to the factor, no 
clear definition of it was achieved. It 
was concluded that the ability is a kind 
of “general-purpose ability” or “trouble- 
shooting ability,” which seems to come 
into play in a difficult test or one that 
contains some very difficult items. 

A hypothesis concerning this factor 
that would be appealing to some is that 
it is synonymous with general intelli- 
gence. Finding it in a large variety of 
tests would be one support for this idea. 
The fact that an arithmetic-reasoning 
test is a popular component of many 
intelligence tests would be another. 
Against the hypothesis, however, is a 
very stubborn fact. Analysis shows 
that many types of intelligence tests 
are not related to the factor; for some 
types, correlations are near zero. 

No one has seriously maintained that 
the factor is the only ability to reason, 
for a number of reasoning factors have 
been found. Our own results show that 


1 Based upon a paper presented at the an- 
nual convention of the American Psychologi- 
cal Association in San Francisco, September 
1-7, 1955. 


there are as many as five that can be 
called reasoning factors. 

At the beginning of the project studies 
of aptitudes at the University of South- 
ern California, we favored the hypothe- 
sis that general reasoning has something 
significant to do with problem solving. 
Although not all problems are cast in 
arithmetical form, and there are indeed 
a great variety of problems, we tended 
to follow traditional thinking to the ef- 
fect thai underlying the solving of prob- 
lems in general there is a common psy- 
chological core—that we can speak of 
problem solving in a generic sense. We 
have to admit that we do not know 
much about the nature of this phe- 
nomenon, but general reasoning might 
be one of the common elements. 

Our first analysis of reasoning abili- 
ties began with five alternative hypothe- 
ses regarding the nature of general rea- 
soning (2). The factor might be: 

1. A general ability to manipulate 
symbols. This would make the ability 
perhaps as broad as thinking itself, thus 
going beyond problem solving. 

2. A general ability to solve prob- 
lems. This narrows the conception to 
a more reasonable level. 

3. An ability to define problems. This 
concentrates attention on the first steps 
of problem solving, and stops short of 
the actual solving activities. 

4. An ability to test hypotheses. Hy- 
pothesis testing occurs during the defin- 
ing of a problem and also at later stages. 

5. An ability to organize a sequence 
of related steps. An arithmetic-reason- 
ing problem that is at all complex re- 
quires a sequence of numerical opera- 
tions to solve it. We know that the 
actual numerical operations depend upon 
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the factor of numerical facility. An 
arithmetic-reasoning test is commonly 
loaded on both this and the general- 
reasoning factor. 

Our first two factor analyses (2, 5) 
eliminated the first hypothesis, in agree- 
ment with previous analyses. Not all 
tests involving symbols are loaded on 
it. Furthermore, we found a new abil- 
ity to manipulate symbols on a re- 
stricted scale. That is, it was confined 
characteristically to tests whose items 
approach equation form. 

A recent analysis of planning abilities 
(4) has shown a factor that could be 
called “ordering ability,” which in a 
way takes care of Hypothesis 5, for the 
factor is separate and distinct from 
general reasoning. 

Results have more and more pointed 
to the elimination of Hypothesis 2, that 
the factor is a general problem-solving 
ability. There are too many tests in- 
volving problems in which the factor is 
absent. This is especially true where 
the test material is composed of figures 
rather than words and/or numbers. We 
have come to the view that a common, 
unique, psychological core for all prob- 
lem solving does not exist (3). Prob- 
lems are simply too varied, and each 
type seems to call upon its own pattern 
of abilities—perceptual abilities as well 
as thinking abilities. 

As we have gone from one analysis 
to another, our preferred hypothesis 
concerning the factor has undergone 
changes, without reaching a final reso- 
lution. The interpretation that was fa- 
vored after each successive analysis can 
be briefly stated: 

6. Effectiveness of trial-and-error ma- 
nipulation of symbols in the solving of 
problems of certain kinds (2). This 
gives much attention to the fact that 
problems that are so difficult as to in- 
volve the examinee in somewhat random 
attempts are the best material for meas- 
urement of the factor. 


7. Ability to consider a number of 
conditions in defining or understanding 
a problem (5). 

8. Ability to size up a test (or prob- 
lem) situation to see what is demanded 
by it (12). 

9. Ability to structure a problem (8). 

10. Ability to find and maintain a 
problem structure when considerations 
are complex (4). 

Examination of these suggestions 
shows that the original Hypothesis 3, 
that the factor is an ability to define 
problems, is very persistent. We also 
see that two new ideas have entered the 
picture. One emphasizes a possible trial- 
and-error aspect and the other empha- 
sizes the complexity of the task. The 
latter idea suggests a span concept. In- 
dividuals may differ with respect to the 
level of complexity they can handle in 
thinking. The complexity may be cali- 
brated ideally in terms of the number 
of elements, relations, or variables that 
the problem entails. 

A very recent analysis (6), which was 
concentrated on this factor, attempted 
to reach some decision among the three 
alternatives; that the factor is an abil- 
ity (a) to understand or define a prob- 
lem, (4) to think effectively by trial 
and error, and (c) to handle a complex 
set of elements, relations, or variables. 

Tests designed to measure ability to 
understand problems concentrated on 
the first stages of solving arithmetic- 
reasoning problems. Given a question, 
what facts are or are not essential to 
finding an answer? Given a problem, 
what arithmetical operations are essen- 
tial? Tests emphasizing trial and error 
presented problems in which trying one 
thing after another would be a natural 
mode of attacking the task. Tests em- 
phasizing complexity either required the 
handling of many elements in every 
item or increased the number progres- 
sively throughout the test. It will have 
to be admitted that experimental sepa- 
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ration of these features between sets of 
tests was not completely achieved. 

The results throw considerable doubt 
on the trial-and-error hypothesis but do 
not eliminate it from all future consid- 
eration. The three tests especially de- 
signed to emphasize trial and error were 
not loaded on the factor. 

One of the four tests designed to em- 
phasize complexity had a relatively high 
loading on the factor, in fact the high- 
est for any test in this analysis. This 
test is known as Ship Destination. In 
each item the problem is to tell how 
many miles a ship is from a port on a 
simple map. New conditions that modify 
the effective distance are added as the 
test progresses—wind direction, current 
direction, initial heading, and strength 
of wind and of current. In a sense, the 
test belongs in the category with arith- 
metic reasoning. Structuring the prob- 
lem would still be important. 

Two of the three tests designed to em- 


phasize definition of problems had sub- 


stantial loadings on the factor. The 
third had shown some relationship to 
the factor in two earlier analyses. It 
had been revised in the meantime, how- 
ever, with an effort to slant the test to- 
ward some other factor. Perhaps this 
effort had succeeded too well. 

Thus, we are forced back upon the 
hypothesis having to do with under- 
standing problems. Just what kind of 
understanding is important and what 
kinds of problems, in addition to those 
of the arithmetic-reasoning type, are still 
to be determined. 

During the course of our investiga- 
tions, other hypotheses have been sug- 
gested from other sources. In his mono- 
graph on aptitude factors, French calls 
the factor “deduction” (1). There are 
one or two other reasoning factors that 
have greater claim to this name and 
that have come out in analyses, distinct 
from general reasoning. This hypothe- 
sis is thus easily eliminated. 
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More recently Lucas and French (10) 
have suggested that the factor is the 
ability to do quantitative or mathemati- 
cal reasoning. Under this hypothesis, it 
would be difficult to explain why the 
factor is consistently loaded in tests 
that present problems of a nonmathe- 
matical and nonquantitative type (such 
as reading comprehension) and in which 
it is difficult to see how numbers and 
mathematical thinking would help. We 
also have evidence from our project re- 
search that there is sometimes little or 
no relation of general reasoning to cri- 
teria of achievement in college mathe- 
matics (9). We must therefore also re- 
ject this hypothesis. 

In conclusion, we may say that it has 
been much easier to decide what general 
reasoning is not than to say what it is. 
We have given reasons for believing that 
it is not general intelligence, though it 
is undoubtedly one of the important in- 
tellectual abilities; it is not a broad 
ability to manipulate symbols, but there 
seems to be a narrow ability of this 
sort; it is not a generic ability to solve 
problems, though it undoubtedly plays 
a role in solving problems of certain 
types; it is not an ability to order a 
sequence of steps, but there is an order- 
ing ability, which may be of some im- 
portance in planning; it is not an ability 
to use trial and error in problem solv- 
ing; and it is not a general idea span, 
though the span concept may be found 
to apply in some limited way yet to be 
determined. 

By elimination and by consistent in- 
dications of a positive nature, the best 
we can say is that general reasoning has 
something to do with comprehending or 
structuring problems of certain kinds in 
in preparation for solving them. The 
range of problems that are pertinent is 
still to be determined. Whether this 
ability extends beyond problem solving 
is also to be determined. It may be a 








general ability to formulate complex 
conceptions of many kinds. 
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It is clear that an increasing number 
of theories of personality find it useful, if 
not necessary, to employ some construct 
referring to the psychological situation 
—e.g., life space (10), phenomenal field 
(13, 15), or meaningful environment 
(14). Such theories do not attempt to 
relate behavior to the physical or geo- 
graphic environment. Instead, they at- 
tempt to make predictions by reference 
to the environment as it is perceptually, 
cognitively, or functionally responded 
to by an organism or class of organ- 
isms. In view of the generally recog- 


nized importance of such theoretical ef- 
forts, on the one hand, and in view of 
certain questions that have been raised 


about the methodological status of these 
efforts, on the other, it would seem op- 
portune to examine the issues involved 
in some detail. 

The most frequently cited challenges 
to phenomenological’ orientations in 
personality theory have questioned the 
degree to which such orientations may 
be considered to be truly physicalistic, 
and the degree to which they may be 
considered to be predictive rather than 
postdictive systems. These issues are, 
of course, not unrelated to each other. 
Lewin’s system, perhaps because of the 
explicitness with which it is presented, 
has served most often as the prototype 
of phenomenological theories when these 


1The term phenomenological and its vari- 
ants are used in this paper to refer to theories 
which employ some construct referring to the 
psychological situation, eg., life space, be- 
havioral environment, etc. The present usage 
divests the term of certain historical connota- 
tions, such as holism or introspectionism, 
which have accrued to it. 


issues have been discussed. Brunswik 
(4), for example, describes Lewin’s life 
space as postperceptual and prebehav- 
ioral, and doubts whether Lewin’s pre- 
dictions can, in a strict sense of the 
word, be tested. Spence (16) has 
stressed the failure of such field theo- 
ries as Lewin’s to provide us with laws 
that will enable control and manipula- 
tion of the behavior-determining psycho- 
logical field. His analysis has gone fur- 
ther to emphasize the distinction be- 
tween the types of laws that are 
achieved by phenomenological theories 
versus those attained by systems which 
do refer behavior to the physico-geo- 
graphic environment. The former laws 
are called R-R laws, and are considered 
to relate one set of responses to another 
set of responses obtained earlier from 
the subject or the experimenter. The 
latter laws are referred to as S-R laws, 
and are considered to relate responses 
to independently measured physical and 
social environmental variables. 

The aims of this paper are three- 
fold: to support the position that phe- 
nomenological theories of personality are 
compatible with the general scientific 
requirements of physicalism and pre- 
dictiveness, to demonstrate that a fun- 
damental methodological distinction be- 
tween these theories and nonphenome- 
nological theories is difficult to maintain, 
and, finally, to suggest that a crucial 
issue intrinsic to a discussion of the 
problems mentioned above—and one 
rarely raised in that connection—is the 
nature of the psychological data lan- 
guage traditionally favored by S-R and 
by R-R theories. 
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THE PROBLEM OF THE PSYCHOLOGICAL 
ENVIRONMENT 


The essential characteristic of all con- 
structs which refer to a psychological 
environment is that their nature and 
properties are response-inferred. It is 
unfortunate that some usages of such 
response inferences have been of the 
sort as to couple them closely in time 
to the behavior that is being predicted. 
This has given the impression of a very 
limited kind of predictiveness, if not, in 
actuality, a kind of postdiction. It has 
also raised the related question of the 
independence of the two sets of re- 
sponses. Since the problem of predic- 
tion and the problem of independence 
of variables are both closely bound up 
with the problem of the definition of 
the stimulus, the latter will be reviewed 
at some length. In this way a clearer 
picture of the methodological status of 
the psychological-environment construct 
should emerge. It may be noted, par- 
enthetically, that Lewin’s theory, as an 
example of phenomenological theories, 
is considered by Spence (16) to achieve 
laws between independent variables; the 
emphasis of his remarks is on the point 
that the variables are all response vari- 
ables. 

The term stimulus in psychology im- 
plies, by definition, a relatedness to 
responses rather than independence of 
them. It is impossible to isolate, point 
to, or describe aspects of the environ- 
ment as stimuli except insofar as they 
have some determinable relationship to 
the responses of some organism. To 
conceive, therefore, of stimulus vari- 
ables as independent of response vari- 
ables in general appears to be unwar- 
ranted. This point has been raised in 
connection with other problems by both 
Brown (2) and Bakan (1). Bakan has 


noted that variables are simply sets of 
categories, and “categories are the re- 
sult of someone’s delineation, abstract- 
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tion, and identification” (1, p. 47). In 
this sense there are no stimulus vari- 
ables which are not response-inferred or 
defined, including the variables of phys- 
ics and sociology. The organism which 
is the reference point for the process of 
definition in physics is, of course, the 
experimenter. It follows from these 
considerations that all stimuli are re- 
sponse-defined at some time by some 
organism. The issue becomes that of 
whose response definitions, under what 
conditions. 

There are two other ways in which 
the meaning of the term independence 
may be considered. 

1. A distinction may be attempted 
between the responses of the subjects 
and those of the experimenter, that is 
to say, the stimulus definition by the 
experimenter may be considered inde- 
pendent of the responses of his subjects. 
Davis (8) has presented this position 
in its extreme in his recent call for a 
more physical psychology. Davis states: 
“For a ‘stimulus’ (external event) to 
qualify under the proposed canon, it 
would have to be something which an 
experimenter could ascertain without 
there being any organism for it to work 
on” (8, p. 10). This statement is 
worth examining closely. In view of 
our previous discussion, it may be 
noted that Davis concedes, at least, 
that the stimulus is defined by the re- 
sponse of the experimenter, i.e., “some- 
thing which the experimenter could as- 
certain.” More appropriate to the pres- 
ent point, however, is the fact that the 
word “stimulus” is placed in quotes by 
Davis. Obviously, this evidences rec- 
ognition that external events may be 
considered stimuli only when some re- 
lationship is demonstrable between the 
presence or occurrence of the event and 
some behavior or response of an or- 
ganism. 

If this view were not held, in actu- 
ality, by psychologists, the result would 
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be the investigation and manipulation of 
an infinite array of physically discrimi- 
nable variables (discriminated by the 
experimenter), some of which may have 
effects on organism—e.g., a loud noise 
—and some of which may not—e.g., a 
sound of a frequency too high to be 
directly perceived by human subjects. 
Even the traditional S-R theorists ma- 
nipulate variables which are defined by 
the responses of their rats rather than 
by themselves alone. The empirical 
law of effect is a concept which illus- 
trates how classes of noxious stimuli 
and incentives are defined by the ap- 
proach or avoidance behavior of popu- 
lations of rats. To summarize this 
point, it is possible for experimenters 
to discriminate or identify variables in- 
dependently of the responses of their 
subjects, but these variables may be 
considered as stimuli only when they 
are functionally related to, i.e., defined 
by, the responses of their subjects. 

2. The second sense in which the in- 


dependence of the stimulus may be con- 
sidered is in terms of its independence 
from a given or particular response. 
Stated otherwise, the issue is whether 
the stimulus may be identified before 


the particular response occurs. This is 
a crucial issue, since it involves the en- 
tire process of prediction and control. 
Most of the phenomenological systems 
have often been content to identify the 
stimulus only after the given response 
has occurred and, more seriously, have 
not adequately provided or described 
techniques or procedures for enabling 
the experimenter to predict behavior by 
knowledge of the nature of the stimulus 
in advance of the occurrence of the par- 
ticular response. Spence (16) is cor- 
rect in his criticism of such phenome- 
nological approaches for not telling us 
what to do to an individual in order to 
manipulate, change, or control his be- 
havior. This criticism, however, applies 
to phenomenologcial systems such as 
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those of Lewin, Snygg and Combs, and 
Rogers, not because they utilize re- 
sponse-inferred constructs but rather 
because they have traditionally and 
uniformly rejected a historical approach 
to the problem of prediction. It is only 
an ahistorical phenomenology which has 
limited predictiveness, or is essentially 
postdictive. A historically oriented phe- 
nomenology (e.g., 14) enables the ex- 
perimenter to predict behavior by pro- 
viding a basis for advance knowledge of 
the stimulus. 

Let us examine this latter point in 
greater detail. The phenomenologists 
have maintained that behavior is a 
function of the psychological situation 
(10), the stimulus functions (9), or 
the meanings of stimuli (14), etc. The 
problem for them is to know, in ad- 
vance of the given response, how the 
subject will constitute the stimulus situ- 
ation, i.e., what meaning it will have 
for him. Since the meaning of a stimu- 
lus, or the precise way in which a vari- 
able functions as a stimulus, is inferred 
from the responses of the subject, it 
follows that, for the prediction of a 
given response in a particular situation, 
the stimuli must be inferred from previ- 
ous responses of the subject, or similar 
subjects, in similar or systematically re- 
lated situations. 

This is precisely the way in which 
S-R behavior theorists proceed to de- 
fine their stimuli in advance of the be- 
havior of their rats. The way in which 
it is predicted that a pellet of food in a 
T maze will be an incentive stimulus 
for a hungry rat, and will result in ap- 
proach and eating behavior, is simply 
by having observed that this rat, or 
other similarly hungry rats, previously 
approached and ate the food pellet. 

For a similar approach to the prob- - 
lems of prediction in personality, it is 
possible to mention several well-known 
procedures which enable specification of 





176 


the stimulus situation independent of 
and prior to a given response. 

1. Verbal report, by either the sub- 
ject or the experimenter. Spence has 
pointed out that “. . . the phenomeno- 
logical approach has its advantages, par- 
ticularly in the complex field of social 
behavior of the human adult. It is ob- 
viously much easier to gain some notion 
as to the relevant variables determining 
such complex behavior by asking the 
individual to verbalize than it is to em- 
ploy the procedure of trying to hy- 
pothesize them from knowledge of past 
history” (16, p. 57). Unique to hu- 
man beings is their ability to verbalize 
their perceptions of situations or the 
meanings that they have for them. 
This information, in spite of its well- 
known limitations, is exceedingly valu- 
able to the personality theorist inter- 
ested in predicting the subject’s subse- 
quent behavior in those situations. And 
as Brunswik (5) has pointed out, it is 


possible to utilize verbal reports with- 
out falling back upon introspectionism. 

Psychological tests may be consid- 
ered as a specific example of this gen- 


eral procedure. Test data are essen- 
eially self reports or verbalized percep- 
tions of stimulus situations. They are 
highly useful in attaining knowledge of 
the meaning or perception of future or 
subsequent real-life situations by the 
subject, to the extent that the test situ- 
ation is similar to or systematically co- 
ordinated to the criterion or predicted 
situation. 

2. Observation of the responses of 
the general culture group to which a 
given subject belongs. It is possible to 
define a stimulus situation for a given 
subject on the basis of previous re- 
sponses by the culture group to which 
the subject belongs. Obviously, the ac- 
curacy or the applicability of such a 
culture-group definition for a given sub- 
ject depends upon his relatedness to 
that group, or the similarity of his past 


RICHARD JESSOR 


experiences to those of the group mem- 
bers. Snygg and Combs (15), for ex- 
ample, point out that people who share 
common roles in a common culture de- 
velop common characters in their phe- 
nomenal fields and consequently in their 
behavior. In short, similar social learn- 
ing will result in similar perceptions of 
environments. 

It will be seen that the broader the 
reference group, the less will be the like- 
lihood that its definition of the situation 
will be useful or accurate for a given 
individual. If one were interested in 
the meaning of a classroom quiz situa- 
tion for a particular college sophomore 
who is also a fraternity member, the 
responses of the reference group of 
college sophomore fraternity members 
might be more applicable than those of 
college students as a whole, which in 
turn might be more accurate (that is, 
predictive) than those of the middle 
class as a whole, etc. 

3. The clinical method. This leads 
to the third procedure, which assists us 
in coordinating idiographic and nomo- 
thetic approaches. In the preceding 
paragraph the use of culture-group defi- 
nitions of situations was advocated. 
Since such definitions will apply only 
more or less to given subjects, they are 
useful for predictions about differences 
between groups of individuals, but will 
be unsatisfactory where specific indi- 
vidual prediction of behavior is desired. 
In the latter case it becomes necessary 
to reduce the reference points to the 
unique past experiences of the particu- 
lar subject. By intensive study of the 
subject’s previous responses in various 
situations, the personality theorist is 
able to establish the meaning or poten- 
tial perception of a situation for a spe- 
cific subject, and hence to predict sub- 
sequent behavior. This procedure, of 
course, is the clinical method, and the 
present description of it emphasizes its 
continuity with nomothetic procedures. 
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More concrete elaboration of the impli- 
cation of these three general procedures 
for defining situations will be made 
shortly. 

The preceding discussion has stressed 
the point that all stimuli are response- 
defined at some time by either the sub- 
ject or the experimenter. In view of 
this homogeneity, it seems unwarranted 
to establish a fundamental logical or 
methodological distinction between S-R 
and R-R theories. Viewed in this light, 
the use of constructs referring to a psy- 
chological environment would seem to 
require no special justification. The dis- 
cussion to this point has also delineated 
at least the potential compatibility of 
phenomenological theories with the nec- 
essary demand for predictive adequacy. 


THE NEED FOR A PSYCHOLOGICAL 
Data LANGUAGE 


Granting the argument to this point 
does not at all lead to the conclusion 
that S-R and R-R theories do not differ 
in some significant way. The question 
to be asked is, Wherein is there a dif- 
ference between these types of theories? 
The answer seems to lie in the data lan- 
guage that is favored to describe the 
situation or environment in which be- 
havior takes place. 

There is common agreement among 
various theorists that psychology thus 
far has neglected the development of an 
adequate descriptive terminology for the 
environment. While personality theo- 
rists have been interested for a long 
time in categorizing the behavioral or 
internal states of human subjects, they 
have paid far less heed to developing 
categories for describing different kinds 
of situations. Such attempts as have 
been made are suggestive, but remain 
limited in usage to particular theoreti- 
cal orientations, and no widely accepted 
or universal descriptive terminology for 
the environment exists at present. It 
seems to the present author that one 
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difference between the S-R theories and 
the phenomenological or R-R theories 
is that the former have heavily em- 
ployed the data language of physics 
(despite the reliance on such terms as 
cue and goal), whereas the latter have 
tended toward a more psychological 
data language. The reliance by the 
former on physical terms seems to be 
one of the factors which has contributed 
importantly to the appearance of in- 
dependent stimulus definition that was 
discussed above. 

Undeniably effective use has been 
made by S-R theory of the language of 
physics. Such a data language has the 
tremendous advantage of being one in 
which, for historical reasons, maximum 
reliability or interobserver agreement 
can be attained, and which involves con- 
cepts for which accurate measures have 
been developed. Nevertheless, reliabil- 
ity alone is ins’ ‘ficient qualification for 
any language s,stem. What is required 
is that it be adequate for the purposes 
and problems of a given science or level 
of description. Since the real world is 
neutral with respect to language sys- 
tems, there is no a priori reason, beyond 
degree of reliability, for the use of the 
data language of physics by another sci- 
ence such as psychology. The remain- 
der of this paper will attempt to point 
out possible reasons why the language 
of physics has proved unsatisfactory for 
personality theory, or for psychologists 
dealing with complex human social be- 
havior, and proposes the development of 
a psychological data language. 

It may forestall misunderstanding to 
state at this point that our quarrel is 
not with methodological physicalism, 
ie., the insistence on objectivity via de- 
notative reduction or observational re- 
liability. Rather, the critique is aimed 
at the incorporation into psychology of 
the language of physics. The latter 
procedure is part of what Brunswik 
has designated as thematic physicalism, 
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“. .. the uncritical emulation of the 
. aims and problem content of phys- 

ics by other disciplines” (5, p. 14). 

Methodological physicalism in no way 
implies that the various branches of sci- 
ence must use the same language of de- 
scription, more specifically the language 
of physics. Carnap (6) indicates recog- 
nition of the necessity for terms other 
than physical ones in his distinction be- 
tween physics and biology. He notes, 
with respect to the latter science, that 
“The terms which are used in this field 
in addition to logico-mathematical and 
physical terms may be called . . . bio- 
logical terms” (6, p. 412). Such terms, 
as well as psychological terms to de- 
scribe situations, must, of course, be re- 
ducible to observable thing-predicates, 
i.e., to terms which designate properties 
which can be determined by direct ob- 
servation. It is in this sense that Car- 
nap discusses the unity of the language 
of science, while recognizing the ab- 
sence of unity of scientific laws. It is 
congruent with such a position to call 
for laws which are clearly psychological, 
i.e., statements of relations between psy- 
chological terms. 

The problem of a data language other 
than a purely physical one is much more 
acute for the phenomenological theories 
dealing with humans than for the S-R 
theories which have dealt largely with 
lower organisms. (a) The situations in 
which human social behavior takes place 
are far more complex than the typical 
rat-learning situation. For the latter it 
has been adequate thus far to employ 
the language of physics, since the ma- 
nipulated stimulus variables are often 
simple physical dimensions such as 
amount of illumination, intensity of 
electric shock, etc., and since highly 
restricted and controlled situations are 
characteristically employed in such re- 
search. We do not imply that S-R ap- 
proaches have utilized only the language 
of physics; we are maintaining that a 
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large number of terms from that disci- 
pline are employed, and that when ex- 
tension to complex human social situa- 
tions is undertaken, such terms will 
prove inadequate, and more appropriate 
language will have to be invented. How 
to describe the situation where a col- 
lege student is asking a coed for a date 
is an example; description in terms of 
the amount of illumination, atmospheric 
pressure, etc., will contribute little to a 
predictive analysis of such a complex 
situation. The Hawthorne studies (12) 
provide an excellent example of how a 
physical description of the stimulus 
situation in terms of changes in illumi- 
nation was inadequate to account for 
the direction of changes in worker pro- 
ductivity. (b) Relatively greater vari- 
ability of the human response repertoire 
would seem to be another reason for 
failure of the language of physics in 
personality theory. A wide variety of 
potential responses implies a multiplic- 
ity of potential definitions of the same 
situation by different subjects. (c) 
The presence in humans of language 
and memory, which allow for the sym- 
bolization of variables not physically 
present in the situation though insti- 
gated by the present situation, would 
seem to be a further complicating 
factor. 

It is proposed that one of the immedi- 
ately pressing tasks for psychology to 
undertake is the development of an ade- 
quate psychological data language to 
describe the environment. This is a 
language whose terms have reference to 
the nature (behavior) of some organ- 
ism or class of organisms, since the 
existence and nature of stimuli depend 
upon the characteristics, needs, habits, 
expectancies, intelligence, etc., of or- 
ganisms. It represents a different ab- 
straction from and organization of the 
basic referential level, or level of ob- 
servable thing-predicates, than does the 
data language of physics. A simple ex- 
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ample is the term goal, which presup- 
poses a directed or striving organism. 
The same situation, as described physi- 
cally, would be described differently in 
psychological terms for different organ- 
isms or classes of organisms. Some 
tentative beginnings in this direction 
are referred to below. 

Rotter (14) has proposed that situa- 
tions be described by their cultural 
meanings in terms of the characteristic 
reinforcements or goals which are likely 
to occur in those situations. Since no 
situation always provides just one kind 
of a goal, situations may be character- 
ized as mixed, or may be described in 
terms of the dominant or usual or most 
frequent goals likely to occur. Situa- 
tions may also be described as am- 
biguous or unfamiliar where the ex- 
pectancies for the occurrence of any 
particular goal are low. Thus, for pre- 
diction at the level of personality, 
situations may be described as love and 
affection situations, the college quiz 
may be characterized as an academic 
recognition situation, etc. Rotter goes 
on to state: 


In this sense people, too, may be thought of 
as situations; it makes good sense to speak 
of authority figure situations, heterosexual 
situations, and so forth when these terms 
imply that a particular kind of reinforcement 
is likely to occur (14, p. 202). 


Murray (11) has presented a set of 
terms belonging to the category of Press, 
which describes the environment or 
stimulus situation according to the kind 
of effect—facilitating or obstructing— 
that it is exerting or could exert upon 
the organism. Lewin’s (10) terms, such 
as valence, barrier, etc., are also of 
this sort. 

Chein (7), too, has suggested a geo- 
behavioral language, i.e., a set of con- 
cepts to describe the real world looked 
at from a point of view that is con- 
cerned with understanding behavior. 

With these examples in mind, the 
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question arises as to the reliability or 
intercommunicability of such descrip- 
tions. But this would seem to be the 
only question which legitimately may 
be raised, despite any feelings of dis- 
comfort over these psychological terms. 
That intersubjectivity of such descrip- 
tions can be achieved may be illus- 
trated from a host of psychological re- 
search. Mention here may be made of 
a series of researches (14) carried out 
in relation to Rotter’s social learning 
theory, in which characterizations of 
experimental situations in terms of the 
potential goals they involve for college 
students have led to predictions of be- 
havior which have consistently received 
empirical support. 

The development of a psychological 
language of description might well be 
enhanced by following the suggestion of 
McClelland (3, p. 145) that a phe- 
nomenological census be carried out to 
discover what things and attributes in 
the environment people look for and 
attend to in guiding their behavior. 
Fruitful concepts may also be derived 
from sociology and anthropology, which 
disciplines have concentrated heavily 
upon descriptions, with respect to hu- 
man behavior, of social or cultural 
situations. Such descriptive terms will, 
of course, vary in their relevance for 
different culture groups and especially 
for different individuals. 

It is the increased and explicit utili- 
zation of a psychological data language 
which enables a rapprochement between 
S-R and R-R or phenomenological theo- 
ries of personality. Although it may be 
maintained that all psychological laws 
are fundamentally R-R laws, neverthe- 
less it does make better sense to speak 
of S-R laws as the goal of psychology 
when the definition of S is made in 
psychological terms, and the issue of 
independence of §S is restricted to in- 
dependence from a particular response 
in a given situation. 
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This formal theory is a small part of 
the later stages of a program of em- 
pirical research on social influence.’ It 
tries to integrate previous findings into 
a logically consistent theory from which 
one can derive testable hypotheses to 
guide future research.2 The more spe- 
cific purpose of the theory is to explore 
the extent to which the influence proc- 
ess in groups can be explained in terms 
of patterns of interpersonal relations. 

In discussing the effects of the ma- 
jority on conformity by the individual 
deviate, Asch states, “The effects ob- 
tained are not the result of a summa- 
tion of influences proceeding from each 
member of the group; it is necessary to 
conceive the results as being relation- 
Both 


ally determined” (15, p. 186). 
Heider (18) and Newcomb (29) have 
treated patterns of opinion and of inter- 
personal relations as a single system of 
relations, though they have discussed 


only two-person groups. The present 
theory reduces the process of influence 
in N-person groups to a summation of 
interpersonal influences which takes into 
account three complex patterns of rela- 
tions: (a) the power relations among 
members of the group, (d) the com- 
munication networks or patterns of in- 
teraction in the group, and (c) the re- 
lations among opinions within the group. 
Thus propositions which have been con- 


1The work reported in this paper was 
financed in part by a grant from the Rocke- 
feller Foundation and by a contract with the 
Group Psychology Branch of the Office of 
Naval Research. 

2Similar current attempts to construct 
mathematical theories of social influence in- 
clude unpublished papers by Ardie Lubin, by 
Harold Guetzkow and Herbert Simon, and 
by Solomon Goldberg. 


ceptualized at the group level (e.g., 
that the strength of group standards in- 
creases with increasing cohesiveness of 
the group) are deduced from concepts 
at the interpersonal level. 

The deductive power and the internal 
consistency of a mathematical model 
stem from a set of explicit definitions 
and postulates stated with enough pre- 
cision so that one can apply the rules of 
logic. But the construction of theory 
by coordinating mathematical definitions 
and postulates to psychological con- 
structs and assumptions leads to a di- 
lemma: the very precision which gives 
power to the theory also tends to over- 
simplify it. For reasons of mathemati- 
cal convenience one tends to make sim- 
ple assumptions which so restrict the 
theory that it may seem unrealistic com- 
pared to the complexity observed in so- 
cial behavior. Game theory, for exam- 
ple, describes certain aspects of how 
“the rational economic man” ought to 
behave, but actual economic behavior 
often departs widely from this simple 
ideal (20). 

The present theory deals with this di- 
lemma partly by utilizing a kind of 
mathematics, the theory of directed 
graphs, which does not require the mak- 
ing of precise quantitative assumptions 
about empirical variables.* In addi- 
tion, the basic concepts and postulates 


8 The theory of directed graphs, which is an 
extension of graph theory (16), has been 
studied by Frank Harary and Robert Norman 
with a view toward utilization by social sci- 
entists. A publication of this work is planned 
for the near future (17). The author is in- 
debted to these mathematicians for specific 
help in proving the theorems of this theory as 
well as for their work on the theory of 
digraphs upon which it is based. 
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of this theory were chosen to conform 
to the results of experiments on social 
influence. Frequently, however, our 
present knowledge was not adequate for 
making these choices in precise detail. 
At these points we attempted to choose 
postulates which would be essentially 
correct in their main outlines even 
though some details would have to be 
changed as new empirical knowledge 
accumulates. It is not surprising, there- 
fore, that many of the theorems are 
quite similar to previous findings about 
influence on opinions and _ attitudes, 
even though no research has been done 
specifically to test this theory. Never- 
theless we have intentionally oversimpli- 
fied the process of social influence by 
omitting many important determinants 
and by making very restrictive assump- 
tions about others. It seemed wise to 
start by examining the implications of 
a small number of postulates before 
proceeding to more complex theories. 


Tue Mover 


Following the theory of quasi-station- 
ary equilibria of Lewin (23), changes in 
opinion, attitude, or judgment are con- 
ceptualized in terms of forces operating 
along a unidimensional continuum (5, 
12). Social influences are coordinated to 
force fields induced by person A on per- 
son B; and the strength of these forces 
is assumed to vary with the power of A 
over B. The potential force field corre- 
sponding to this power relation will be 
actualized only if A communicates to B 
or interacts with him. When A expresses 
his opinion or argues for it in a way 
that influences B, then the force field 
operating on B has a central position 
corresponding to A’s position along the 
continuum of opinion. All the forces 
operating on B are directed toward this 
central position, so B will tend to change 
his opinion in a direction which brings 
him closer to A. Similarly, other mem- 
bers, C, D, E, etc., who communicate 
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to B may set up force fields on him with 
central positions corresponding to their 
own opinions. The actual changes in 
B’s opinion will be in accordance with 
the resultant force from all these in- 
duced forces plus a force corresponding 
to his own resistance. 

In order to derive the exact amount 
of influence that each member will have 
on the opinion of every other, let us as- 
sume that we are dealing with a unidi- 
mensional continuum of opinion which 
can be measured with a ratio scale. We 
might think, for example, of the classic 
experiment on social norms by Sherif 
(32), where the members of the group 
were asked to state their opinions about 
how many inches the light moved as 
they viewed the autokinetic effect. We 
shall denote the members of the group 
by A, B,C, . . . and their initial opin- 
ions by a, b,c, . . . respectively, where 
a is the distance of A’s opinion from the 
zero point on the scale. The abscissa 
of Fig. 1 shows such a scale of opinion 
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Fic. 1. The force fields influencing opinion. 


together with the initial opinions of A 
and B. The ordinate indicates the 
strength of the forces. The gradient of 
forces around A represents the forces he 
can induce on B to agree with his opin- 
ion, while the gradient of forces around 
B represent his tendency to resist chang- 
ing his opinion. Where these two gradi- 
ents intersect, a distance of 4(a + d) 
from the origin, there is an equilibrium 
point where the two forces are equal in 
strength and opposite in direction. At 
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all points to the right of this equilibrium 
the forces induced by A are stronger 
than B’s resistance, so B will move to- 
ward the point of equilibrium. Con- 
versely, at all points to the left of the 
equilibrium, B’s resistance forces are 
stronger than A’s inductions, so B will 
still move toward the point of equilib- 
rium. Similar calculations of the re- 
sultant force and consequent changes of 
opinion can be made for A and for 
groups with any number of members by 
placing all members on the same scale 
and by assuming that the gradient of 
forces around each member represents 
both forces he can induce on others and 
forces he can set up as resistance against 
others. 

The process of influence in a group 
takes place gradually over a period of 
time. As one member changes his po- 
sition and begins to influence others to- 
ward his new position, the force fields 
corresponding to his influence will also 
shift their central positions. It will be 
convenient, therefore, to divide the in- 
fluence process into a sequence of units 
defined in terms of opinion change 
rather than in terms of physical time. 
A unit is defined as the time required 
for all members who are being influ- 
enced to shift their opinions to the 
point of equilibrium of all the forces 
operating at the beginning of that unit. 
At the end of the unit, after this shift 
has taken place, we assume that the 
members now start to argue for their 
new opinions. It should be noted that 
this definition implies that all members 
respond at the same rate to the forces 
impinging on them. One possible op- 
erational definition of a unit might be 
a single trial in an experiment such as 
Sherif’s. 

This conception of influence as a proc- 
ess over time implies a distinction be- 
tween direct and indirect influence. In 
a typical organization the president usu- 
ally influences indirectly a person at 
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the bottom of the chain of command 
through orders which are handed down 
through several subordinates. Direct 
influence is exerted on another person 
by direct communication which is not 
channelled through a third person. Jn- 
direct influence is exerted on another 
through the medium of one or more 
other persons. Therefore the direct in- 
fluence of A on B always occurs during 
the same unit, whereas indirect influ- 
ence requires two or more units. For 
example, A influences B directly during 
the first unit, and B influences C toward 
his new opinion during the second unit. 
Thus A has indirectly influenced C by 
transmitting his opinion via B. In this 
model, the power structure and the com- 
munication channels of the group are 
translated into a process of influence 
over time. In the first unit any mem- 
ber, A, influences only those recipients 
of his communication over whom he has 
direct power; in the second unit A’s in- 
fluence is also transmitted to all those 
over whom these intermediaries have 
power; in the third unit A’s influence 
is transmitted to those who are three 
steps removed from him in the power 
structure, etc. 


THE PostTuLATES 


Three main postulates are involved in 


this model. The first is concerned with 
interpersonal power. The definition of 
power used in this postulate is the same 
as that given by Cartwright (6): the 
power of A over B (with respect to a 
given opinion) is equal to the maximum 
force which A can induce on B minus 
the maximum resisting force which B 
can mobilize in the opposite direction. 

The basis of interpersonal power is 
defined as the more or less enduring re- 
lationship between A and B which gives 
rise to the power. French and Raven 
(in an unpublished manuscript) have 
discussed five bases: attraction power 
based on B’s liking for A, expert power 
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based on B’s perception that A has su- 
perior knowledge and information, re- 
ward power based on A’s ability to 
mediate rewards for B, coercive power 
based on A’s ability to mediate punish- 
ments for B, and legitimate power based 
on B’s belief that A has a right to pre- 
scribe his behavior or opinions. Any 
basis of power can vary in strength: 
there may be variations in how much 
B likes A, in how much B respects A’s 
expertness, etc. Postulate 1 is general 
enough to refer to all bases of social 
power. 

Postulate 1. For any given discrep- 
ancy of opinion between A and B, the 
strength of the resultant force which an 
inducer A can exert on an inducee B, 
in the direction of agreeing with A’s 
opinion, is proportional to the strength 
of the bases of power of A over B. 

Attraction as a basis for interpersonal 
influence has been demonstrated in ex- 
periments by Back (2) and by French 


and Snyder (13), and in field studies 
by Lippitt, Polansky, and Rosen (25). 
Expertness as a basis for interpersonal 
power has been demonstrated in the 
latter two studies as well as in many 


others (19, 27, 28). In unpublished 
experiments French and Raven and 
French, Levinger, and Morrison have 
demonstrated that legitimacy and the 
ability to punish are bases for social 
power. Heider (18) and Newcomb 
(29) state their theories in terms of 
“positive relations,” a more general con- 
ception which combines several types of 
power. In most real groups the power 
relations probably do combine several of 
the bases discussed here and others too. 
Postulate 1 refers to all of these bases 
combined. 

Resistance, as a part of the social 
power discussed in Postulate 1, has not 
been treated separately nor in detail in 
this model. In a further development it 
might be coordinated to such factors as 
“certainty of own opinion” (13), or as 
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Kelman (21) and Mausner (26) call 
it, “prior reinforcement,” and to vari- 
ous personality characteristics such as 
rigidity and authoritarianism (8). 

Postulate 2. The strength of the 
force which an inducer A exerts on an 
inducee B, in the direction of agreeing 
with A’s opinion, is proportional to the 
size of the discrepancy between their 
opinions. 

This postulate combines two effects 
which have been demonstrated in previ- 
ous research. (a) More influence is at- 
tempted toward the member who is more 
discrepant (10, 11, 30). These studies 
also show, however, that this effect 
holds only under conditions where the 
inducee is not rejected. Too great a 
deviation leads to changes in the at- 
traction power structure of the group 
and hence to changes in the effects im- 
plied by Postulate 1. (0) If the amount 
of influence attempted is held constant, 
the amount of change in the inducee in- 
creases with increasing size of discrep- 
ancy. For this latter relation, French 
and Gyr (12) report correlations of .77, 
.62, .65, and .83 in different experimen- 
tal groups. Goldberg (14) also reports 
a strong tendency for the amount of 
change to increase with increasing dis- 
crepancy, with the inducee moving 30 
per cent of the way toward the inducer 
for discrepancies of all sizes. In a sub- 
sequent unpublished theoretical paper,* 
Goldberg also assumes that change in 
opinion is a direct function of discrep- 
ancy until the inducee rejects the credi- 
bility (expert power) of the inducer, 
after which it becomes an inverse func- 
tion of discrepancy. Again the data 
support Postulate 2 within the range 
where the expert power structure of the 
group is not changed. 

Postulate 2 is represented in Fig. 1 by 
the two increasing gradients of forces 

*#S. C. Goldberg, Some cognitive aspects of 
social influence: a hypothesis. (Mimeo- 
graphed.) 
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around A’s opinion and around B’s opin- 
ion. The two gradients are assumed to 
be linear, though the evidence cited 
above would suggest that they are curvi- 
linear. We have made the more con- 
venient assumption because it appears 
to be true as a first approximation and 
because it seems to be possible to revise 
the postulate later, if subsequent em- 
pirical data do show curvilinearity, with 
only minor quantitative changes in the 
theorems. 

Postulate 3. In one unit, each per- 
son who is being influenced will change 
his opinion until he reaches the equi- 
librium point where the resultant force 
(of the forces induced by other mem- 
bers at the beginning of the unit and 
the resisting force corresponding to his 
own resistance to change) is equal to 
zero. 

Postulate 3 is an application of a 
basic assumption of Lewin (22) that 
locomotion or restructuring will take 


place in the direction of the resultant 
force whenever that force is greater 


than zero. Though consistent with a 
great many empirical studies, this as- 
sumption is close to a conceptual defi- 
nition which cannot be directly tested. 


THEOREMS 


For lack of space, no attempt will be 
made to state all the theorems which 
have been proven nor to give the for- 
mal proofs of those presented. Instead 
we will select some representative theo- 
rems and indicate informally the nature 
of the derivations. In making empirical 
predictions from these theorems, this 
theory, like any other, must always as- 
sume “other things being equal,” in- 
cluding all extrasystem influences and 
the many factors within the group 
which are not part of the theory. 


The Effects of the Power Structure of 
the Group 


This section presents some theorems 
concerning the effects of the power 
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structure of the group on the influence 
process and its outcome. These theo- 
rems illustrate how the present theory 
explains a well known proposition about 
groups in terms of concepts about inter- 
personal relations. 

This proposition—that the strength 
of group standards increases with in- 
creasing cohesiveness of the group—has 
been substantiated in several studies (2, 
4, 9, 30, 31). A group standard has 
been defined conceptually as group-in- 
duced pressures toward uniformity of 
behavior or belief, and it may be meas- 
ured by the degree of conformity of 
members produced by these pressures. 
Cohesiveness has been defined concep- 
tually as the resultant forces on mem- 
bers to belong to the group, but it has 
been operationalized in many of these 
experiments as the attraction of mem- 
bers for one another (7, 24). Festinger, 
Schachter, and Back (9) have shown 
that the hypothesized relation is stronger 
when cohesiveness is operationalized in 
a way which takes account of the pat- 
tern of the sociometric structure instead 
of a simple summation of choices. But 
each sociometric choice measuring the 
attraction of member B toward member 
A is, according to Postulate 1, a basis 
for A’s power over B. Thus the socio- 
gram of a group can be transformed 
into the attraction power structure of 
the group by simply reversing the direc- 
tion of each arrow. The attraction 
power structure of the group is a spe- 
cial type of power structure, and hence 
it is treated in these theorems about 
power structure and trends toward uni- 
formity of opinion within the group. 

The power structure of a group may 
be represented conceptually in terms of 
the mathematical theory of directed 
graphs, called “digraphs.” A digraph 
is a finite set of points A, B, C,... 
and a subset of the directed lines 
AB, BA, AC, CA, BC, CB, .. . be- 
tween distinct points. In representing 
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power structures as digraphs, we shall 
coordinate points to members and di- 
rected lines to power relations between 
members. In this coordination we shall 
make only relatively crude distinctions 
in differences of power: if “A has power 
over B,” there is a directed line AB 
in the digraph representing the power 
structure of the group; if “A does not 
have power over B,” there is no such 
line. 

Various properties of digraphs may 
be used to characterize power struc- 
tures of groups. We shall be concerned 
here primarily with the “degree of con- 
nectedness” of power structures. In or- 
der to discuss this property we need 
two definitions: complete digraph and 
directed path. A digraph is complete if 
there exists a directed line from each 
point to every other point. A power 
structure would be complete, then, if 
each member had power over each other 
member. If we assume that when A 
chooses B sociometrically B has power 
over A, then it follows that when every 
member of a group chooses every other 
member, the digraph representing the 
power structure of the group will be 
complete (e.g., No. 5 in Fig. 2). A di- 
rected path is a collection of distinct 
points A,B,C, . . . , together with the 
lines AB, BC,... If in the power 
structure of a group there is a directed 
path from A to C, it follows that A can 
exert influence on C even though A may 
not have direct power over C (there 
must be a sequence of directed lines 
originating at A and going to C even 
though there is no line AC). 

In their work on digraphs Harary and 
Norman (17) have defined four degrees 
of connectedness. Their definitions are 
as follows: (a) A digraph is strongly 
connected (or strong) if for every pair 
of distinct points, A and B, there exists 
a directed path from A to B and a di- 
rected path from 8 to A. It follows 
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that every complete digraph is strong, 
but not every strong digraph is com- 
plete. (6) A digraph is unilaterally 
connected (or unilateral) if for every 
pair of points, A and B, there is a di- 
rected path from A to B or from B to 
A. (c) A digraph is weakly connected . 
(or weak) if it is impossible to separate 
the points of the digraph into two 
classes such that no line of the digraph 
has one end point in one class and the 
other end point in the other class. 
Thus, for every possible separation of 
all of the points of a weak digraph into 
two disjoint, nonempty classes, there 
must be at least one line having one 
end point in one class and the other 
end point in the other class. (d) A 
digraph is disconnected if it is not weak. 
Thus a disconnected digraph may be 
separated into two (or more) disjoint 
classes of points such that no line goes 
from one class to the other. From these 
definitions it is clear that all strong 
digraphs are unilaterally and weakly 
connected and that all unilateral di- 
graphs are weakly connected. It is also 
clear that all weak digraphs are not 
strongly connected. For this reason it 
is useful to define a digraph as strictly 
unilateral if it is unilateral but not 
strong, and to define a digraph as 
strictly weak if it is weak but not uni- 
lateral. In our discussion here, when 
we speak of unilateral or weak digraphs 
we shall mean “strictly unilateral” and 
“strictly weak.” 

In groups where each member com- 
municates to all others over whom he 
has direct power during every unit of 
the influence process, the amount of 
uniformity achieved and the speed of 
achieving it tend to vary with the de- 
grees of connectedness of the power 
structure, except that no differences 
were proved for weak vs. disconnected 
digraphs. 

The effect is illustrated in Fig. 2, and 
generalized later in the first four theo- 
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rems. In Fig. 2 five different types of 
structures (complete, strong, unilateral, 
weak, and disconnected) are illustrated 
by digraphs of four-person groups. To 
the right of each structure are curves 
showing some of the theoretically pre- 
dicted changes of opinion. The ordi- 
nate gives the scale of opinion and, at 
the left, the initial opinion of members 


UNIT I 











UNIT 2 




















The effects of connectedness on opinion changes in the group. 


A, B, C, and D. The line labeled 
“Unit 1” gives the distribution of opin- 
ion after the first unit; the line labeled 
“Unit 2” gives the distribution of opin- 
ion after the second unit; and the line 
labeled “Unit «” gives the equilibrium 
of opinions reached in an infinite num- 
ber of units. 

The disconnected structure is com- 
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posed of two cliques, AD and BC. Be- 
tween these two cliques there are no 
paths of influence, regardless of direc- 
tion. Consequently neither clique can 
influence the other, and each will even- 
tually end up with a different opinion. 
Within the AD clique, influence is all in 
one direction, so A will eventually swing 
D over to his opinion. Since D only 
moves half-way in each unit, however, 
he will require an infinite number of 
units to move all the way. Accordingly 
the dotted lines show D’s opinion con- 
verging to A’s at infinity while A’s opin- 
ion remains unchanged. Within the BC 
clique, influence is mutual, so in the first 
unit B will influence C to move half- 
way from 5 to 2 on the opinion scale, 
and likewise C will influence B to move 
half-way from 2 to 5. Therefore both 
B and C will arrive at 34 on the opin- 
ion scale in the first unit and will re- 
main in agreement thereafter. 

The weakly connected structure is 
more highly connected but still does not 
result in unanimous agreement. In this 
case there is no directed path, for ex- 
ample, between A and D, so neither can 
influence the other. 

The unilaterally connected structure 
has a directed path in at least one direc- 
tion between every possible pair of 
points. Because it has a higher degree 
of connectedness, it shows more con- 
vergence of opinion. 

The strongly connected structure has 
directed paths in both directions be- 
tween every possible pair of points. In 
this example the strongly connected di- 
graph is a cycle, yielding a final com- 
mon opinion which reflects more equal 
influence of all members. 

The completely connected structure 
has direct, one-step paths in both di- 
rections between the members of every 
possible pair. It converges in only one 


unit to a final common opinion. 
Theorem 1. For all possible patterns 
of initial opinion, in a completely con- 
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nected power structure the opinions of 
all members will reach a common equi- 
librium level equal to the arithmetic 
mean of the initial opinions of all the 
members, and this final common opin- 
ion will be reached in one unit. 

Under these conditions where the 
power and the resistance of all members 
is equal, we have already illustrated in 
Fig. 1 that the new opinion of A is 
equal to $(a + 5), ie., the arithmetic 
mean of the opinions of both members. 
B’s opinion at the end of the first unit 
5; is also equal to 4(a + 5), according 
to Postulates 2 and 3. Thus this two- 
person group reaches agreement in one 
step. The proof of Theorem 1 for an 
N-person group is a simple extension of 
this example. 

Theorem 2. In an N-person cycle 
(which is a strongly connected group) 
the members will reach a final common 
opinion at the arithmetic mean, (1/1) 
(a+b+c+.. .), in an infinite num- 
ber of units. 

If A has power over B, then 5; = 4(a 
+6) and de, the opinion of B at the 
end of the second unit, = $(a, + 5;). 
In general, B’s opinion at the end of any 
unit will be half-way between his own 
and A’s opinion at the beginning of the 
unit; so the general difference equation 
describing B’s change of opinion in any 
unit, m, is: db, = 4(da+ + 0,1). Solv- 
ing these general difference equations 
for all members constitutes a proof of 
Theorem 2. 

Theorem 3. In a unilaterally con- 
nected group the opinions of all mem- 
bers will converge to a final common 
opinion in an infinite number of steps. 

It is an obvious theorem of digraph 
theory that no strictly unilateral di- 
graph can have more than one point of 
input zero, i.e., with no directed lines 
leading to it (because then these two 
or more points could not have a di- 
rected path between them—which vio- 
lates the definition of a unilateral di- 
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graph). It follows that, during every 
unit, at least one of the two members 
at the extremes of the range of opinion 
will be subject to the power of another 
and will move toward the center, thus 
restricting the range of opinion still 
further. Eventually, therefore, all mem- 
bers will arrive at the same opinion. If 
there is one person in the group with in- 
put zero, then all members will eventu- 
ally agree with his initial opinion, for 
he will influence the others but no one 
will influence him. 

Theorem 4. In a weakly connected 
group the members will not reach com- 
mon agreement except under special 
conditions in the distribution of initial 
opinions. 

A (strictly) weak digraph contains at 
least one pair of points with no directed 
path between them. Thus there are at 
least two members who cannot influence 
each other either directly or indirectly. 

The disconnected group. When the 
final equilibrium has been reached, a 
disconnected group will tend to have 


at least as many different opinions as 
there are cliques (i.e., disjoint classes of 
members), because no clique can influ- 


ence any other. If all the cliques are 
themselves either completely connected, 
strongly connected, or unilaterally con- 
nected, it follows from Theorems 1, 2, 
and 3 that there will be uniformity of 
final opinions within each clique; but 
there will be differences among them ex- 
cept under special conditions in the dis- 
tribution of initial opinion. 
Summarizing the theorems illustrated 
in Fig. 2, we can say that there is a 
“funnelling effect,” a tendency for the 
opinions of individuals to converge to- 
ward one another, and the strength of 
this tendency increases with increasing 
connectedness in the power structure of 
the group. Since the power structure 
includes the special case of the attrac- 
tion power of the group, we have a more 
general group of theorems consistent 
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with the finding that the strength of 
group standards is determined by the 
cohesiveness of the group. Addition- 
ally the model predicts the exact level 
of the group standard as well as the 
precise degree of conformity at each 
unit. Thus we have rigorously derived 
a more differentiated statement of the 
empirically well-established relation be- 
tween cohesiveness and group standards. 

So far we have considered only all-or- 
none variations in the power of A over 
B; now we will illustrate the effect of 
continuous variation. 

Theorem 5. The greater the bases of - 
power of A over B (B’s attraction to A, 
B’s acceptance of A as an expert, etc.), 
the more influence A will have on B 
and subsequently on any other person 
P for whom there exists a directed path 
from B to P. 

According to Postulates 1 and 3, in- 
creases in the basis of power of A over 
B will increase the strength of the re- 
sultant force exerted by A on B and 
therefore the amount of change pro- 
duced in B. Similarly in subsequent 
units this influence will be transmitted, 
though in a weakened form, from B to 
P. 


The Effects of Communication Patterns 


In the preceding section we have 
dealt with the restricted case“of groups 
of persons whose power is always uti- 
lized in every unit.’ Earlier we noted 
that the head of an organization may 
not communicate to all those over whom 
he has direct power but will instead 
follow the established channels of com- 
munication. Likewise in a face-to-face 
group a member may remain silent or 
may attempt to influence some but not 

5It is probable that B will respond partly 
to the relationships among successive influence 
attempts by A, for example to the consistency 
among his various arguments or to the simple 
fact of too much reiteration of the same influ- 
ence attempt; these factors are omitted from 
the present model. 
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others over whom he has power. These 
patterns of interaction often become 
stabilized so that they may be treated 
as more or less consistent channels of 
communication. It is also clear that 
the strength of influence attempted can 
vary continuously, but we shall here 
treat the communication from A to B 
as an all-or-none variable so that we can 
utilize digraph theory. 

Now if we reverse the conditions of 
Theorems 1 through 4 and consider 
only completely connected power struc- 
tures with variations in the degree of 
connectedness of the communication 
channels, we can apply the same four 
theorems and proofs. For example: 

Theorem 1a. For all possible pat- 
terns of initial opinion, in a completely 
connected communication network, the 
opinions of all members will converge 
to a common equilibrium level equal to 
the arithmetic mean of the initial opin- 
ions of all the members; and this final 
common opinion will be reached in one 
unit. 


Similarly, theorems analogous to 2, 
3, and 4 can be stated for strong, uni- 
lateral, and weak communication net- 


works, respectively. All possible net- 
works in experiments of the Bavelas 
type (3) are included in these theorems. 

Even where stable communication 
channels do not exist, this model may 
be applied provided the interaction pat- 
tern is specified for each unit. Con- 
sider a strongly connected cycle of three 
persons. Theorem 2 states that opin- 
ions in this group will converge to a 
final common opinion equal to 4(a + 5 
+c). In Theorem 6 we assume a par- 
ticular communication pattern: A exerts 
influence in the first unit, B and C exert 
influence in the second unit, A exerts 
influence in the third unit, B and C in 
the fourth unit, and so on. 

Theorem 6. In a group where the 
power structure is a three-person cycle 
in which A has power over B, B has 
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power over C, and C has power over A, 
and the communication pattern is 4A, 
BC, A, BC,..., the final common 
opinion in the group equals 4(2a + b 
+ 2c). 

We note that a change in the inter- 
action pattern changes the outcome con- 
siderably. Furthermore B and C no 
longer have equal influence, even though 
they have equal interaction patterns and 
similar positions in the power structure; 
it is the interaction of these two factors 
which produces the difference. A has 
more influence than B because he comes 
first in the sequence of interaction, but 
C has more influence than B because he 
has direct power over A whereas B’s 
power over A is indirect. Intuitively it 
would appear that the “primacy effect” 
shown in this theorem can be general- 
ized: the sooner a person speaks the 
more influence he will have. 


The Effects of Patterns of Opinion 


In an experiment like Sherif’s, each 
member communicates to every other 
and the members probably have rela- 
tively equal power. In such a com- 
pletely connected power structure with 
completely connected communication 
channels, what happens to the opinion 
of a single deviate member? 

Theorem 7. The amount of change 
of the deviate toward the opinions of. 
the majority is proportional to the sum 
of the deviations of all other members 
from the deviate. 

By Theorem 1 the amount of change 
by the deviate D equals d; — d which 
is equal to 1/N(d+a+b+c+...) 
—d. Thus the more members in the 
group the more they will influence the 
deviate. Also the larger each deviation, 
the more D will change. Though these 
predictions are generally congruent with 
Asch’s findings, they probably do not 
agree in detail (1, 15). However, the 
conditions of Asch’s experiment do not 
fit the model very well. 
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Leadership 


To a large extent leadership consists 
of a member’s ability to influence others 
both directly and indirectly by virtue of 
his position in the power structure, in- 
cluding the structure of legitimate au- 
thority. Thus leadership may be dis- 
tributed among many members or con- 
centrated in a few; the pattern of lead- 
ership is a distribution which describes 
the whole group rather than an attribute 
of single individuals. Figure 2 illus- 
trates the dependence of influence on 
the total structure of the group. 

Compare the influence of member A 
in the weakly connected group with the 
influence of member A in the unilater- 
ally connected group. Both groups 
start out with the same distribution of 
opinion, and in both groups A has di- 
rect influence over only B. However, 
A’s influence is markedly different in 
the two cases; in the weakly connected 
group the opinions of others diverge 
more and more from his, whereas in the 
unilaterally connected group the opin- 
ion of all other members converges com- 
pletely to A’s opinion. 

The complete distribution of direct 
plus indirect leadership in a group with 
any power structure and any communi- 
cation network may be calculated by 
matrix multiplication. We may repre- 
sent the power structure of the group as 
a matrix where each row shows the 
power applied to a member and each 
column shows the power exerted by a 
member. A zero in the cell correspond- 
ing to the ath row and the dJth column 
shows that B does not have power over 
A, whereas a one in the cell correspond- 
ing to the cth row and the dth column 
shows that D does have power over C. 
Thus the number in a cell represents 
the number of directed lines from the 


6 See Harary and Norman (16) for a brief 
review of some related applications of matrix 
algebra to sociometric data. 
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person in that column to the person in 
that row (under the conditions assumed 
in this paper, always one or zero). If 
this matrix M is multiplied by itself, 
then the resulting squared matrix M? 
shows in each cell the number of se- 
quences consisting of two directed lines 
betwen the person in the column and 
the person in the row. The cubed 
matrix M* shows the number of three- 
line sequences between each pair of per- 
sons. By raising the matrix to succes- 
sively higher powers, we can thus de- 
termine the number of directed line 
sequences, of various lengths, from each 
member to every other. The matrix M 
gives the directed lines which will re- 
sult in influence in the first unit; M? 
gives the two-line sequences through 
which influence will be exerted by the 
end of the second unit; M®* gives the 
three-line sequences through which in- 
fluence will be exerted by the end of 
the third unit; etc. 

In order to apply this process to 
Group G in Fig. 3, we construct a 
matrix of opinion M where the columns 
a, b, c, d represent influence exerted by 
the initial opinions a, 5, c, d of persons 
A, B, C, D, respectively. The rows 
represent the influence received by these 
opinions from all the opinions in the 
group. Thus the cell entries must show 
the amount by which an opinion is 
changed by another opinion during one 
unit; and these values are given by the 
coefficients in the right hand side of the 
general difference equations. For Group 
G these equations are: 


(1) 
(2) 


An = An-1, 

bn = $(@n-1 + 5n-1), 
Cn = $(n—-1 + Cas), (3) 
d, = §(bn-1 + Con t+ dy—1)- (4) 


The cell a, a has an entry of 1, indicat- 
ing that in any unit A’s opinion is com- 
pletely determined by his previous opin- 
ion; accordingly the remaining cells in 
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row a have entries of zero, showing that 
opinions 5, c, d do not influence a, since 
there are no directed paths from B, C, 
or Dto A. Cells b, a, and b, b have 
entries of 4 because 5, is a compromise 
half-way between the previous opinions 
of A and B, etc. 

It will be noted that each row in M 
(and in M? and M®*) sums to 1 because 
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The distribution of leadership in a weakly connected group. 


it represents the total opinion of a mem- 
ber, and the fractions along the row rep- 
resent the proportion of that opinion de- 
termined by each person. The sum of 
a column in M, on the other hand, rep- 
resents the total influence of a person’s 
opinion during the first unit on the opin- 
ions of all members (including the in- 
fluence of his initial opinion on his sec- 
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ond opinion—which we have called “re- 
sistance”). 

In M? the column sum shows the total 
influence of a person’s initial opinion at 
the end of the second unit (including 
the changes produced in both the first 
and second units). Similarly M@* shows 
the cumulative influence at the end of 
the third unit. The same procedure can 
obviously be extended to any number of 
units. 

Thus the column totals of the succes- 
sive powers of M give the distribution 
of leadership over time, as predicted by 
this theory. In Group G, we can see 
that A, the only member with input 
zero in the power structure, continuously 
increases his influence at the expense of 
the other members. B and C, having 
symmetrical positions in the structure, 
show the same curves of decreasing in- 
fluence; but D, who is influenced by all 
other members, has the least influence. 


CONCLUSIONS 


This theory illustrates a way by which 
many complex phenomena about groups 
can be deduced from a few simple pos- 
tulates about interpersonal relations. By 
the application of digraph theory we are 
able to treat in detail the patterns of 
relations whose importance has long 
been noted by the field theorists. Even 
if this treatment does not turn out to 
be empirically correct, it illustrates the 
need for some such conceptual and 
mathematical tools if we are to make 
progress toward the theoretical integra- 
tion of psychology and sociology. 
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DRIVE, INCENTIVE, AND REINFORCEMENT * 


JOHN P. SEWARD 
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Why does one generally do what leads 
to pleasure and avoid doing what leads 
to pain? To many persons this is like 
asking: What color is that white cat? 
A few, however, may reply: Is that 
really what one does? It is to these 
doubters—psychologists, if you will— 
that the present article is addressed. 

Let us start by granting much of the 
hedonistic premise: human beings do 
tend, on the whole, to choose activities 
with pleasant consequences and to aban- 
don those with painful ones. The sub- 
jective terminology need not deter us. 
Thorndike (45) has already told us how 
to define “satisfiers” and “annoyers” in 
behavioral terms. His original law of 
effect—that the connection between a 
situation and a response is strengthened 
by satisfaction and weakened by dis- 
comfort—could be stated quite objec- 
tively. Moreover, if by “connection” 
he had meant simply the probability 
that response R would occur in situa- 
tion S, he need never have discarded 
one-half of the original law. So inter- 
preted, the law of effect becomes a be- 
havioral statement of hedonism. As 
such it can be extended to a wide range 
of species. 

But having conceded this much we 
find ourselves not far ahead. To put 
the law to use we must know what con- 
ditions make for satisfaction and dis- 
comfort and how behavior varies with 
these and other conditions. In a word, 
we must know ow rewards and punish- 
ments exert their influence. Only by de- 


1The gist of this paper was presented in a 
panel discussion: “How can behavior theory 
best handle the construct of motivation?”, 
Section I, American Association for the Ad- 
vancement of Science, Berkeley, Calif., De- 
cember 28, 1954. 


riving the law itself can we understand 
apparent exceptions to it. Hedonism in 
the guise of the “empirical law of ef- 
fect” (10) may “explain” the obedient 
child; more basic principles are needed 
to explain the “problem child” as well. 

We face anew, therefore, the question 
of how rewarded responses are selected, 
or reinforced, and punished responses 
eliminated. (Reinforcement is here 
used in a strictly empirical sense to re- 
fer to the increase in probability of a re- 
sponse accompanied by a reward.) I 
propose to discuss two theories of rein- 
forcement in the light of experimental 
evidence, to show that these theories 
lead to a dilemma, and to suggest a 
solution. 


Two THEORIES OF REINFORCEMENT 


Drive-Reduction Theory 


The first theory is that of drive re- 
duction, most closely identified with 
Hull. (By drive I mean an excitatory 
state produced by a homeostatic dis- 
turbance.) The primary reinforcing 
agent, for Hull, was “the diminution in 
the receptor discharge characteristic of 
a need” (20, p. 80); ? when it occurred 
near the conjunction of a stimulus and 
a response it produced an increment of 
habit strength between them. Hiull’s 
associates disagree in their use of mo- 
tivational constructs. Brown (5) and 
Farber (14) assign the reinforcing func- 
tion to reduction of a general energiz- 


2 In his final volume Hull changed this defi- 
nition to “the rapid diminution in the motiva- 
tional stimulus (Sp or se)” (2, p. 6). By in- 
cluding so Hull revealed a significant trend in 
his thinking which he never made fully ex- 
plicit; had he done so he might have antici- 
pated much of the present discussion. 
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ing component, drive, restricting the 
drive stimulus to an associative and dis- 
criminatory role. Dollard and Miller 
(13), with Hull, give the drive stimulus 
a twofold function as cue and rein- 
forcer. But in any case rewards are 
held to select responses through the 
learning that results from diminution in 
an excitatory state. 

The scope of drive-reduction theory is 
vastly enlarged by the assumption that 
both drives and rewards can be learned. 
This assumption provides two possible 
mechanisms of secondary reinforcement: 

1. The first, proposed by Mowrer 
(28) to account for avoidance learning, 
involves the reduction of a learned 
drive. If a stimulus has been condi- 
tioned to a drive-producing response, 
removal of that stimulus should serve 
as a reinforcer. Thus a buzzer that has 
preceded a shock will arouse fear; turn- 
ing off the buzzer when a certain re- 
sponse is made should then increase the 
probability of that response. This type 
of adjustment has been repeatedly veri- 
fied (26). 

2. The other mechanism of secondary 
reinforcement is the one postulated by 
Hull (20). In comparison with the 
first it may be characterized as the 
learned reduction of a drive. If a 
. stimulus is conditioned to the response 
of relaxation that occurs when a need 
is gratified, it will arouse some part of 
that response in the absence of the 
original reward. Since learning often 
occurs in situations one or more steps 
removed from primary need gratifica- 
tion, the value of such an explanatory 
device is obvious. It is worth noting 
that this mechanism, contrary to the 
first, has been most frequently verified 
in connection with appetites rather than 
aversions (26). 

Although—or perhaps because—con- 
stantly bombarded by opposing views, 
drive-reduction theory has displayed re- 
markable resilience, surviving many an 
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“ordeal by experiment.” In the last 
few years a new rival has arisen to chal- 
lenge its validity. 


Incentive Theory 


Sheffield’s (39, 41) theory emphasizes 
the role of the consummatory response 
in reinforcement. Developed in close 
correlation with experimental evidence, 
his view turns out to have much in 
common with my concept of tertiary, or 
goal-produced, motivation (37). Briefly 
his argument may be illustrated as fol- 
lows: 

A hungry animal in a runway enters 
a goalbox and eats food. On later trials 
the eating response is generalized to the 
runway and most vigorously activated 
when the animal approaches the goal. 
Since eating cannot be completed with- 
out food, however, its partial blocking 
generates a kind of frustration drive, 
which energizes whatever response is in 
progress. The more vigorous the con- 
summatory response, the stronger will 
be the drive produced by its partial 
arousal. It follows that the response 
sequence leading most directly to the 
goal will induce the most intense drive 
and receive in turn the most potent ex- 
citation. For this reason Sheffield sug- 
gested that his account be called the 
drive-induction theory of rewards to 
contrast it with the older view. To 
avoid confusing two sources of drive, 
however, I prefer to call his an incen- 
tive theory. 


EXPERIMENTAL EVIDENCE 


Space limits forbid a detailed review 


of relevant experiments. In this sec- 
tion we are mainly concerned with the 
logic of inquiry and the interpretation 
of results. 

The responses involved in satisfying 
a bodily need may be arbitrarily marked 
off in three overlapping but roughly se- 
quential stages: (a) instrumental and 
preparatory responses (R;), highly vari- 
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able and largely learned; (5) consum- 
matory or goal responses (Rg)—e.g., 
chewing, swallowing, copulating—rela- 
tively uniform within a species; (c) 
homeostatic processes (G)—e.g., diges- 
tion, ejaculation—by which the dis- 
turbed balance is temporarily restored. 
The two theories under discussion were 
advanced to explain the selection of 
Rj: one does so by invoking G; the 
other, Rg. To decide between them 
means must be devised to break up the 
natural sequence so as to separate 
stages (b) and (c). 

Efforts to this end have taken two di- 
rections. One aims to preserve Rg but 
prevent G; e.g., by sham feeding. If 
under these conditions R; increases in 
strength or resists extinction, we may 
infer that drive reduction is not neces- 
sary to reinforcement. If, on the other 
hand, R; remains weak or extinguishes, 
it follows that Rg is not a sufficient re- 
inforcing condition. The second method 
is to bypass Rg and induce G directly; 
e.g., by stomach fistula. By strictly 
analogous reasoning, if this method re- 
inforces R;, we conclude that Rg is not 
a necessary condition; if it does not, 
drive reduction is shown to be insuffi- 
cient. 


Goal Response Without Drive Reduc- 
tion 


Here we shall consider only sham- 
feeding experiments using saccharine,® 
a nonnutritive substance relished by 


rats. Its effectiveness as a reward was 
demonstrated by Sheffield and Roby’s 
well-known experiment (40), and con- 
firmed by Cockrell (11). Our question 
is whether to attribute these results to 
Rg, as incentive theory requires, or to 
vicarious drive reduction. 

Sham feeding may well induce some 


3 Exploratory experiments by Hull et al. 
(22) on one dog with an esophageal fistula 
gave promise that pursuit of this method 
should yield rich returns. 
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of the visceral responses, with accom- 
panying relaxation, characteristic of real 
feeding. Carlson (8) reported that, in 
his subject with a gastric fistula, tastes, 
chewing movements, and swallowing in- 
hibited stomach contractions and the 
accompanying hunger pangs. (He noted 
that such stimulation frequently aug- 
ments appetite.) The inhibiting effect 
may be either reflex or learned; if 
learned it coincides with the mechanism 
of secondary reinforcement suggested by 
Hull. Carlson’s observations on dogs 
and men, however, were not true of 
rabbits and guinea pigs, more closely 
related to rats (8, p. 168). 

The argument for secondary rein- 
forcement implies several consequences: 

1. By stimulus generalization the re- 
ward value of saccharine or any other 
nonnutritive substance should depend 
on how closely its flavor resembled that 
of the animal’s normal diet. Where 
saccharine would fall on such a gradi- 
ent is unknown.* 

2. The effect should eventually ex- 
tinguish through differential reinforce- 
ment. Sheffield and Roby found no 
such trend in three experiments cover- 
ing 42 days of training, though this may 
not have been long enough. 

3. Saccharine should be less effec- 
tive, in the long run, than sugars. Re- 
sults conflict, Carper (9) finding the 
expected difference, and Thomson (42) 
finding none. From the standpoint of 
incentive theory, of course, such com- 
parisons are meaningless until stimulus 
concentrations are established for which 
nonhungry rats show no taste prefer- 
ence. 

4. If pretraining is given to associate 
a sweet taste with need reduction, its 


4 Measuring nerve impulses from the rat’s 
tongue, Beidler found that responses to sac- 
charine resembled those to salts and acids 
rather than sugars; moreover, “a single fiber 
responding to salt stimulation also responds 
to saccharin but not to glucose or sucrose” 
(2, p. 598). 
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reward value should be enhanced.. One 
test of this prediction (41) gave nega- 
tive results. 

In summary, it appears that Rq’s can 
reinforce behavior even when they fail 
to satisfy the relevant need. Whether 
this ability is inherent or merely bor- 
rowed as learned drive reduction can- 
not yet be decided.® 


Drive Reduction Without Goal Re- 
Sponse 


Escape and avoidance learning. To 
test the incentive hypothesis—that Rg 
can reinforce R, in the absence of G— 
it was necessary to select an appetite, 
such as hunger or sex, with a charac- 
teristic Rg and artificially prevent its 
normal consequence. Drive-reduction 
theory, on the other hand, can be 
most readily verified by using noxious 
external stimuli, since the drives they 
produce (pain, fear, annoyance) can 


be manipulated independently of innate 


Rq’s. Organisms are equipped, it is 
true, with flexion, startle, eyelid, and 
other defensive reflexes. But no one of 
these can be relied on to eliminate a 
disturbance (e.g., a buzzing fly), and 
other responses are typically brought 
into play. A rat shocked on a charged 
grid has a variety of responses avail- 
able and he will learn to make which- 
ever one consistently lessens or turns 
off the shock or shock signal, whether 
this is to sit or press a pedal (29), to 
jump in the air, or run (31). Such 
demonstrations point to a large and im- 
portant class of reinforcers to which in- 
centive theory, as here defined, does 
not apply. 

Other methods. Drive-reduction theo- 
rists, on the other hand, do not con- 
fine themselves to the aversions. While 


5 The experiments of Sheffield, Wulff, and 
Backer (42) and Kagan (23), using “sham 
coitus,” should also be read with this prob- 
lem in mind. 
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Sheffield and his coworkers gathered 
evidence for the importance of Rg, 
Miller and Kessen (27) in the same 
laboratory were reinforcing R; by pour- 
ing food directly into a rat’s stomach 
by fistula. True, mouth-fed rats learned 
much faster. But on the basis of other 
experiments (3, 24) the authors claimed 
that hunger was better satisfied by 
mouth. 

Earlier experiments (18, 34) indi- 
cated that rats deficient in B vitamins 
acquired a preference for foods con- 
taining them, apparently due to bene- 
ficial aftereffects. Thomson and Porter 
(44) recently reported learning in so- 
dium-deprived rats rewarded by a sub- 
liminal salt solution. But probably the 
purest test of a need-reduction theory 
was provided by Coppock and Cham- 
bers (12), who taught hungry rats a 
head movement by rewarding them with 
an intravenous injection of glucose. 

Evidently drive reduction can select 
responses in the absence of any charac- 
teristic goal response or goal stimulus. 


Interpretation 


These experiments leave us “on the 
fence.” There is evidence for both 
consummatory and homeostatic factors 
in reinforcement. The chief unsettled 
question is to what extent Rg owes its 
potency to previous association with G. 
This question, however, leads to an- 
other that may help us to answer the 
first. 

If Rg is merely a secondary reinforcer 
of type (5), its antedating component 
(rq) should also reduce the appropriate 
drive to some extent. But Sheffield’s 
view, we recall, is just the opposite: as 
a frustrated response, rq produces an 
added drive stimulus. Does rg function 
as drive or as partial reward? A final 
answer has not yet been found (38), 
but some evidence for Sheffleld’s view 
is already at hand (1, 15, 16, 33, 39). 
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A Cuoice-Pornt DILEMMA 


Since the two theories are not strictly 
incompatible, it is possible to accept 
both. Indeed the foregoing evidence 
strongly indicates that both drive re- 
duction and incentive play a part in 
reinforcement. But this conclusion re- 
solves one dilemma only to trap us in 
another. 

I have already alluded to this new 
dilemma in another context (36). Con- 
sider rat A at the choice point of a T 
maze. The entire floor of the maze is 
a charged grid except for the right-hand 
goal box. There is little doubt that 
in the course of repeated noncorrection 
trials the average rat will develop a 
strong right-turning habit. Followers 
of both Hull and Guthrie will agree, 
but for different reasons, that the key 
to this outcome is the removal of the 
drive stimulus, shock, in the goal box. 

Now consider rat B at a similar choice 
point but without shock. Instead, the 
rat is hungry and the entire maze is 
empty except for the right-hand goal 
box, which contains a few drops of sac- 
charine solution. Again, as Sheffield 
and Roby have shown, the rat will 
learn to turn right. Suppose we reject 
the tenuous assumption that the taste 
of saccharine has the power, innate or 
acquired, to reduce hunger, but grant 
it the power to evoke drinking. Re- 
sorting to Sheffield’s incentive hypothe- 
sis, we infer that right-turning stimuli 
arouse incipient drinking and that this 
frustrated Rg produces more drive to 
strengthen the ongoing act. 

At this point we confront the di- 
lemma: How can rat A choose the path 
of less drive while rat B chooses the 
path of more drive? If shock and ap- 
petizing “food” are both sources of 
drive to an organism at a choice point, 
how can he learn to escape or avoid one 
and approach the other? I contend 
that these are not exceptional cases but 
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typical of aversions and appetites in 
general. If so we face a problem of 
fundamental significance. 

Brown (6) recently discussed in this 
Journal a related question. How, he 
asked, can drive-reduction theory ac- 
commodate the fact that responses are 
sometimes strengthened in the face of 
increasing stimulation? Though much 
of his paper was devoted to distinguish- 
ing between stimulation and drive, he 
considered the special case in which a 
response is accompanied or followed by 
drive increase without disruption. His 
answer, closely allied to the one to be 
offered below, was that a drive will in- 
tensify an ongoing act if the drive stimu- 
lus is not connected with interfering re- 
sponses. But for a drive-reduction theo- 
rist our choice-point dilemma does not 
arise, since for him reward training does 
not involve a drive increment. Only 


in the light of incentive theory and 
the evidence for it does this problem 


become unavoidable. For this reason 
Brown could not raise the further ques- 
tion: how does his principle—that drive 
stimuli disrupt behavior only if asso- 
ciated with conflicting reactions—enable 
us to predict opposite outcomes for es- 
cape and reward training? 

My own attempts to solve this prob- 
lem, starting five or ten years ago, re- 
flect its stubbornness. In retrospect I 
find I have vacillated between the two 
viewpoints here brought face to face— 
between considering rewards as satisfiers 
and as incentives. A brief review may 
clarify my present position. 

My first reaction to the dilemma was 
to assume that increasing drive, or ten- 
sion, inhibits activity in progress while 
decreasing tension promotes it. On the 
surface, at least, this seems a plausible 
premise: by equating “tension-reducing” 
with “pleasant” and “tension-building” 
with “unpleasant” we can say with the 
hedonists—and neo-hedonists like Mc- 
Clelland (25) and Young (46)—that 
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organisms act in such a manner as to 
maximize pleasure and minimize un- 
pleasantness. 

On this assumption I introduced the 
concept of tertiary motivation (35) to 
suggest that secondary reinforcers act 
by “realizing” a goal set (rsq), thereby 
lowering its tension. Mowrer pointed 
out the error of confusing drive-reduc- 
ing function with motivation. Later 
(37) I corrected it by identifying terti- 
ary drive with the blocking of rsq, but 
still held that drive (rsp) and goal sur- 
rogate were negatively related; release 
of rsq diminished rsp and facilitated 
responses in progress. 

This position now appears untenable. 
Osgood (32, p. 443), discussing the dual 
role of the anticipatory goal reaction as 
both drive and reward, pointed out the 
difficulty; so did Mowrer (30) in his 
most recent theoretical statement. To 
explain delayed reward learning, for ex- 
ample, we should have to argue that 


approach to the goal permits rsq to ap- 
proach completion, prometing the cor- 
rect response by reducing tertiary drive. 
But if this were true, drive would be 
progressively lowered as one neared the 
goal; the closer one came, the less en- 
ergy would be available for approach 


behavior. Unfortunately I can no 
longer shut my eyes to the evidence for 
a goal gradient both in strength of ap- 
proach (4, 19) and in height of tension 
(16, 39). 

The alternative is to start from the 
opposite premise: viz., that rewards 
function as incentives. Once before 
(36) I explored some of the possibili- 
ties of this view. A motive was de- 
scribed in terms of two moments, drive 
(rsp) and goal (rsqg). This much I be- 
lieve was on solid ground. To ac- 
count for the goal direction of behav- 
ior, however, I simply weighted rsq 
more heavily than rsp in a formula for 
motive strength. This step was prob- 
ably abortive. Even if a method could 
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be devised for empirically determining 
the weights, the formula was ad hoc, 
with little rationale beyond the obser- 
vations it was designed to explain. 

Whether this admittedly trial-and- 
error process has helped or hindered, I 
believe the materials are now available 
for a more fruitful approach. 


TowarpD A SOLUTION 


From previous treatments we sal- 
vage two motivational constructs, to be 
classed as mediating or response-pro- 
duced stimuli, one concerned with insti- 
gators, the other with satisfiers. The 
first is the surrogate drive stimulus or 
drive (rsp). It is produced by certain 
disturbers of homeostasis, whether ex- 
ternal (e.g., pain and pain signals) or 
internal (e.g., metabolic deficit); it is 
reduced by removal of the disturbing 
stimulus or by replacement of the de- 
ficit. Drive has two functions, associa- 
tive and nonassociative: as motor it en- 
ergizes responses in general; as cue it 
becomes conditioned to specific re- 
sponses. These two functions corre- 
spond roughly to D and Sp in Hull’s 
system. 

For the second construct the terms 
surrogate goal stimulus, tertiary drive, 
and goal (rs,) are interchangeable. It 
is produced by the strong incipient 
arousal of a response under conditions 
that prevent its completion. Any such 
response qualifies as a goal response 
(R,): one large class undoubtedly con- 
sists of consummatory responses (Ra); 
another and more inclusive class, of re- 
sponses coincident with drive removal. 
In contrast with drives, goals may be 
reduced in two ways: by removing the 
stimulus to R, and by completing R,. 
In common with drives, goals have the 
same two functions. A fractional R, 
produces cues that may be conditioned 
to particular responses; attempts to 
complete it ahead of time generate non- 
specific goal tension. The closest analo- 
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gies in current theory are Brown and 
Farber’s (7) frustration constructs F 
and Sy, although rs, is used in situa- 
tions these authors would hardly call 
frustrating. 

We now ask what are the roles of 
rSp and rs, in response selection? My 
answer is simple and far from original. 
Paraphrasing Guthrie (17), I suggest 
that both drive and goal surrogates, 
along with the external stimuli giving 
rise to them, become conditioned to 
each response made in their presence 
and remain so until conditioned to an 
incompatible response. 

To apply this principle to response 
selection we need an estimate of the 
strength of motivational stimuli that 
will be and stay conditioned to the R, 
in question. On the assumption that 
any such stimulus remaining after R, 
has been made will be unconditioned, 
we may introduce the concept of effec- 
tive stimulus intensity (S), defined as 
the difference between intensity before 
(Sz) and intensity after (S4) Ry. For 
effective drive surrogate we may write: 


TSp = TSpp — TSpa- 


Similarly, effective goal surrogate may 
be written: 


Sg = TSop — TSga- 


Note, however, that in practice 7s, can 
often be more readily computed from 
some measure related to R, itself, such 
as amount of reward. 

Let us now return to the choice point 
and try to predict the behavior of rats 
A and B. To simplify the task we shall 
assume other things equal and evaluate 
only the part played by motivational 
cues. First, how does rat A learn to es- 
cape the charged grid by taking the 
right-hand path? In the absence of a 
definite Rg drive stimuli presumably 
play the dominant role. After a num- 
ber of trials the 75p’s conditioned to the 
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left and right paths will approximate 
the following values: 


Left: TSpB — Spa ms 0, 
Right: rspp — 0 = rsppz. 


Rat A will therefore turn right. 

Secondly, how does rat B learn to 
take the right-hand path to saccharine? 
Hunger-drive stimuli are presumably di- 
minished only slightly, if at all, so they 
contribute little to selective motivation. 
After a number of trials, however, a 
goal surrogate is aroused by the right- 
path stimuli and becomes conditioned 
to an approach response. It remains 
conditioned, moreover, because it is re- 
moved by the actual drinking of sac- 
charine in the goal box. Since no rs, 
is conditioned to left-hand stimuli or 
responses except by generalization, rat 
B will turn right. 

Even for a simplified version the above 
account is oversimplified. It is highly 


probable that in the course of repeated 


escapes from shock rat A will learn some 
response to the goal box that will gen- 
eralize to the choice point as rs,. This 
process should then contribute to total 
reinforcement by the mechanism already 
described for saccharine. Similarly, in 
training to 2 nutritive reward if not to 
saccharine itself, rat B’s hunger should 
act just like rat A’s shock in strength- 
ening the correct response. How drive 
and goal combine and interact is a prob- 
lem for future development. 

This is only a fragment of theory. It 
needs to be fitted into a more compre- 
hensive framework before its implica- 
tions can be tested or even derived. I 
believe this can be done and have taken 
some steps toward modifying Hull’s 
equations in order to accommodate the 
proposed concepts. Enough of the the- 
ory is given, however, to reveal its roots 
and one or two branches. It is founded 
squarely on Guthrie’s doctrine of asso- 
ciation by contiguity, extending his prin- 
ciple to a special class of stimuli, mo- 
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tives, with two subclasses, drives and 
goals.° It contains more than a hint of 
Tolman’s expectancy in its notion of 
goal surrogate, and its hope of elabora- 
tion owes much to Hull. This eclecti- 
cism calls for no apology. Current theo- 
rizing, with its increasing reliance on 
mathematical tools, displays a healthy 
tendency to disregard earlier lines of 
cleavage. 


SUMMARY 


Scratch a motivation theory and you 
find the problem of reinforcement. This 
paper deals with two possible reinforc- 
ers: drive reduction and consummatory 
response. Experiments designed to sepa- 
rate their influence have yielded evi- 
dence in support of both. Drive reduc- 
tion seems to be a sufficient condition 
for response selection. But rewards ap- 
parently do more than reduce drives; 
as incentives they appear to heighten 
excitement as well as activate instru- 
mental responses. This poses a prob- 
lem: How can drive reduction and in- 
centive induction both produce the same 
result, i.e., strengthen a concurrent re- 
sponse? It is suggested that Guthrie’s 
theory, with the aid of two motiva- 
tional constructs, drive and goal, offers 
a solution. 


REFERENCES 


1. Amset, A., & RousseL, Jacqguetive. Mo- 
tivational properties of frustration: I. 
Effect on a running response of the ad- 
dition of frustration to the motivational 
complex. J. exp. Psychol. 1952, 43, 
363-368. 

2. Bemrer, L. M. Properties of chemorecep- 
tors of tongue of rat. J. Neurophysiol. 
1953, 16, 595-607. 

3. Berxun, M. M., Kessen, Marion L., & 
Mutter, N. E. MHunger-reducing ef- 


® Readers acquainted with my earlier efforts 
(see 36) may find this amusing. It was the 
problem of reinforcement that first led me 
away from Guthrie’s theory; the same prob- 
lem, like Maeterlinck’s bluebird, now brings 
me back sadder and wiser (?) to my starting 
point. 





Joun P. SEWARD 


10. 


11. 


13. 


14. 


17. 





fects of food by stomach fistula versus 
food by mouth measured by a consum- 
matory response. J. comp. physiol. 
Psychol., 1952, 45, 550-554. 


. Brown, J. S. Gradients of approach and 


avoidance responses and their relation to 
level of motivation. J. comp. physiol. 
Psychol., 1948, 41, 450-465. 


. Brown, J. S. Problems presented by the 


concept of acquired drives. In Current 
theory and research in motivation: a 
symposium. Lincoln: Univer. of Ne- 
braska Press, 1953. Pp. 1-21. 


. Brown, J. S. Pleasure-seeking behavior 


and the drive-reduction hypothesis. Psy- 
chol. Rev., 1955, 62, 169-179. 


. Brown, J. S., & Farser, I. E. Emotions 


conceptualized as intervening variables— 
wiih suggestions toward a theory of 
frustration. Psychol. Bull. 1951, 48, 
465-495. 


. Cartson, A. J. The control of hunger in 


health and disease. 
Chicago Press, 1916. 


Chicago: Univer. of 


. Carrer, J. W. A comparison of the re- 


inforcing value of a nutritive and a 
non-nutritive substance under conditions 


of specific and general hunger. Amer. 
J. Psychol., 1953, 66, 270-277. 
Carr, H. A. The law of effect. A round 


table discussion. I. Psychol. Rev., 1938, 
45, 191-199. 

Cocxrett, J. T. Operant behavior of 
white rats in relation to the concentra- 
tion of a non-nutritive sweet substance 
used as reinforcement. Unpublished doc- 
tor’s dissertation, Indiana Univer., 1952. 


. Coppock, H. W., & CuHampers, R. M. Re- 


inforcement of position preference by 
automatic intravenous injections of glu- 
cose. J. comp. physiol. Psychol., 1954, 
47, 355-358. 

Doxtarp, J., & Mitter, N. E. Personality 
and psychotherapy. New York: Mc- 
Graw-Hill, 1950. 

Farper, I. E. Anxiety as a drive state. 
In M. R. Jones (Ed.), Nebraska sym- 
posium on moiivation 1954. Lincoln: 
Univer. of Nebraska Press, 1954. Pp. 
1-46. 


. Geter, F.M. The measurement of tension 


in the rat. J. comp. Psychol., 1942, 34, 
43-49. 


. Geter, F. M., & Totaan, E. C. Goal dis- 


I. The goal 
J. comp. 


tance and restless activity. 
gradient of restless activity. 
Psychol., 1943, 35, 197-204. 

Guturi, E.R. The psychology of learn- 
ing. (Rev. Ed.) New York: Harper, 
1952. 





Drive, INCENTIVE, AND 


. Harris, L. J., Cray, J., Harcreaves, F. J., 
& Warp, A. Appetite and choice of diet. 
The ability of the vitamin B deficient 
rat to discriminate between diets con- 
taining and lacking the vitamin. Proc. 
roy. Soc. Lond., Ser. B, 1933, 113, 161- 
190. 

. Hurt, C. L. The rat’s speed-of-locomo- 
tion gradient in the approach to food. 
J. comp. Psychol., 1934, 17, 393-422. 

. Hur, C. L. Principles of behavior. New 
York: Appleton-Century, 1943. 

. Hurt, C. L. A behavior system. 
Haven: Yale Univer. Press, 1952. 
. Hutz, C. L., Livuncston, J. R., Rouse, 
R. O., & Barker, A. N. True, sham, 
and esophageal feeding as _ reinforce- 
ments. J. comp. physiol. Psychol., 
1951, 44, 236-245. 

. Kacan, J. Differential reward value of 
incomplete and complete sexual behav- 
ior. J. comp. physiol. Psychol., 1955, 
48, 59-64. 

. Koun, M. Satiation of hunger from food 
injected directly into the stomach ver- 
sus food ingested by mouth. J. comp. 
physiol. Psychol., 1951, 44, 412-422. 

. McCrettanno, D. C., Arxinson, J. W., 
CrarkK, R. A. & Lowen, E. L. The 
achievement motive. New York: Ap- 
pleton-Century-Crofts, 1953. 

. Miter, N. E. Learnable drives and re- 
wards. In S. S. Stevens (Ed.), Hanid- 
book of experimental psychology. New 
York: Wiley, 1951. Pp. 435-472. 

. Mriter, N. E., & Kessen, Marton L. Re- 
ward effects of food via stomach fistula 
compared with those of food via mouth. 
J. comp. physiol. Psychol., 1952, 45, 
555-564. 

. Mowrer,O.H. A stimulus-response analy- 
sis of anxiety and its role as a reinforc- 
ing agent. Psychol. Rev., 1939, 46, 553- 
564. 

. Mowrer,O. H. An experimental analogue 
of “regression,” with incidental observa- 
tions on “reaction-formation.” J. ab- 
norm. soc. Psychol., 1940, 35, 56-87. 

. Mowrer, O. H. Two-factor learning the- 
ory reconsidered with special reference 
to secondary reinforcement and the con- 
cept of habit. Psychol. Rev., 1956, 63, 
114-128. 

. Mowrer, O. H., & Lamoreaux, R. R. 
Fear as an intervening variable in avoid- 
ance conditioning. J. comp. Psychol., 
1946, 39, 29-50. 

. Oscoop, C. E. Method and theory in ex- 
perimental psychology. New York: Ox- 
ford Univer. Press, 1953. 


New 


33. 


. Sewarp, J. P. 


REINFORCEMENT 203 


Rousset, Jacguetine S. Frustration ef- 
fect as a function of repeated non-re- 
inforcements and as a function of the 
consistency of reinforcement prior to 
the introduction of non-reinforcement. 
Unpublished master’s thesis, Tulane 
Univer., 1952. 


. Scott, E. M., & Verney, E. L. Self-selec- 


tion of diet. VI. The nature of appe- 
tites for B vitamins. J. Nutrition, 1947, 
34, 471-480. 


. Sewarp, J. P. Secondary reinforcement 


as tertiary motivation: a revision of 
Hull’s revision. Psychol. Rev., 1950, 
57, 362-374. 

Introduction to a theory 
of motivation in learning. Psychol. 


Rev., 1952, 59, 405-413. 


. Sewarp, J. P. How are motives learned? 


Psychol. Rev., 1953, 60, 99-110. 


. Sewarp, J. P., & Peresoom, A. C. Does 


the activity wheel measure goal striv- 
ing? J. comp. physiol. Psychol., 1955, 
48, 272-277. 


. SHEFFIELD, F. D., & Campsett, B. A. The 


role of experience in the “spontaneous” 
activity of hungry rats. J. comp. 
physiol. Psychol., 1954, 47, 97-100. 


. SHEFFIELD, F. D., & Rosy, T. B. Re- 


ward value of a non-nutritive sweet 
taste. J. comp. physiol. Psychol., 1950, 
43, 471-481. 


. Suerrretp, F. D., Rosy, T. B., & Camp- 


BELL, B.A. Drive reduction versus con- 
summatory behavior as determinants of 
reinforcement. J. comp. physiol. Psy- 
chol., 1954, 47, 349-355. 


. Suerrretp, F. D., Wutrr, J. J., & BACKER, 


R. Reward value of copulation with- 
out sex drive reduction. J. comp. 
physiol. Psychol. 1951, 44, 3-8. 


. Taomson, C. W. Need reduction and 


primary reinforcement: A comparison 
of the reinforcing value of sugar and 
saccharine. Paper read at West. Psy- 
chol. Ass., Long Beach, May 20, 1954. 


. Tuaomson, C. W., & Porter, P. B. Need 


reduction and primary reinforcement: 
maze learning by sodium-deprived rats 
for a subthreshold saline reward. J. 
comp. physiol. Psychol., 1953, 46, 281- 
287. 


. THornvike, E. L. Animal intelligence: 


experimental studies. New York: Mac- 
millan, 1911. 


. Younc, P. T. The role of hedonic proc- 


esses in the organization of behavior. 
Psychol. Rev., 1952, 59, 249-262. 


(Received May 12, 1955) 





Psychological Review 
Vol. 63, No. 3, 1956 


INFORMATION-RECEIVING BEHAVIOR OF MAN 
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Hokkaido University 


The recent development of communi- 
cation theory has certainly stimulated 
the corresponding area of psychology, 
and many experiments have been de- 
signed and _ successfully performed, 
mostly for the purpose of ensuring the 
applicability of concepts involved in 
the theory to psychological analysis. 
It is no wonder that the theory con- 
tributes much to the analysis of hu- 
man communicative behavior, since at 
almost any time any person is receiving 
information from his surrounding world 
(perceiving), storing it (memorizing), 
as far as his communicative behavior is 
and sending it (conducting); and thus, 
concerned, he can always be regarded 
as a kind of transducer in a communi- 
cation system—i.e., as a receiver of 
power from one system and a supplier 
of power to a different system. 

A psychologist would be at a loss, 
however, if he wished to go a step fur- 
ther into the detailed complexity of 
actual human communicative behavior 
by directly applying the outcomes of 
the theory, since, though man is surely 
a kind of transducer, he is one of the 
types hitherto never discussed. Present- 
day communication theory has been de- 
veloped mainly to deal with the prob- 
lems of engineering communication, of 
which a distinguished characteristic is 
that it is carried out under rather stable 
boundary conditions, serving a rather 
limited variety of purposes. Thus it is 
a relatively easy task to find the most 
efficient acting transducers when cer- 
tain boundary conditions are given. On 
the other hand, man performs many 
other functions than that of transducer 
and thus, even when communicating, he 
should serve a rich variety of purposes, 


many of which may be entirely irrele- 
vant to efficient transmission. For ex- 
ample, as the receiver, he should not 
only know what the original message is 
but should extract from signals the in- 
formation content which is usually called 
meaning; as the memorizer, he should 
select the information content to be 
memorized according to its degrees of 
importance; and finally, as the sender, 
he will sometimes want to send wrong 
information to deceive his enemy. And 
all of those performances have usually 
to be done under rather unstable con- 
ditions. 

What is unstable in human communi- 
cation is not man himself. There are 
many evidences, mostly physiological, 
which make one suspect that man has 
surprising capabilities in stabilizing him- 
self as a mechanical system. Most of 
the causes of the instability of human 
communication systems are to be looked 
for within the instability of the relations 
between man and his surrounding world. 
Unlike Shannon’s mechanical receiver 
(4), man usually has to deal not with 
a single ensemble but with a large va- 
riety of them simultaneously, most of 
which could possibly be statistically un- 
stable. Further, no proper rules of de- 
coding are given by senders, and they 
have to be discovered by the receiver 
himself; it seems quite improbable that 
there could be any transducer acting 
efficiently enough in such an unstable 
and complex situation. Certainly, there 
exist no mechanical transducers as yet 
which meet the requirement. The only 
possible type of transducer which could 
meet it should at least satisfy the fol- 
lowing two conditions: it should be ca- 
pable by itself of developing adequately 
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efficient rules of decoding separately for 
different information sources; and it 
should be capable of changing the rule 
of decoding as rapidly as possible when- 
ever it receives the information that 
the statistical structure of an ensemble 
seems to have changed. 

In other words, the competent trans- 
ducer should be one that learns; for 
learning roughly means those two func- 
tions, gathering information from the 
surrounding world, and changing the 
performance in a way that suits the en- 
vironment best. No one would deny 
that man is one such type of a system, 
i.e., one that learns, although it is doubt- 
ful if he learns most efficiently as a 
transducer. But, if we take into ac- 
count the severe conditions under which 
he should act as a transducer—the facts 
that he should perform many tasks other 
than communication, that he should act 
below a certain rate of energy expendi- 
ture, and so forth—we ought to con- 
sider him a fairly, though not the most, 
As far as 


efficient type of transducer. 
this is taken for granted, it should be 
possible, though difficult, to find the 
principles or a theory formulating the 
functions of a transducer of this type, 


or of a man as a transducer. In the 
following sections we shall outline what 
considerations are needed for the the- 
ory, restricted mostly to the theory of 
the human receiver, and not of the hu- 
man transducer as a whole. 


CoMMUNICATION AS A GAME 


One of the most distinctive features 
of human communication, not yet dis- 
cussed fully, is that it requires game- 
theoretical considerations. In engineer- 
ing communication, the purposes of 
sender and of receiver are usually in 
perfect congruence; the sender wishes 
to send as much information as possible 
and the receiver wishes to know what 
the original message is as correctly as 
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possible. On the other hand, in actual 
human communication they are not al- 
ways in such accord. Sometimes the 
sender is indifferent to the receiver; 
signals fali only incidentally on the re- 
ceiver, or the sender knows nothing 
about the fact that his emitted signals 
are caught by the receiver. Sometimes 
the sender is hostile to the receiver, and 
he intends to deceive the receiver by 
sending signals which would convey 
wrong information if decoded by con- 
ventional rules. In the latter case they 
are in a game situation. Even in the 
former case, the receiver often has to 
be careful not to be deceived, and has 
to behave as if he were in a game. 
This is clearly illustrated in A. Wald’s 
theory of statistical decision functions 
(8), in which it is formalized that a 
statistician intending to decode signals 
from Nature (apparently an indifferent 
sender of information) should sometimes 
be careful to use the minimax strategy 
in establishing his decoding rules. Ac- 
cordingly, it is clear that game theory 
will play a particularly important role 
in the theory of human communication. 
Von Neumann and Morgenstern’s the- 
ory (7), however, is not directly ap- 
plicable to our purposes. Here what is 
required is a theory that is more spe- 
cifically developed to be able to de- 
scribe various patterns of actual human 
behavior—that takes into account, for 
example, the fact that not all of the 
participants in communication games 
are complete strategists. If a partici- 
pant found that his opponent is no such 
thing, or if he gets the information that 
his opponent’s behavior is in some way 
stereotyped, then to take advantage of 
that fact would usually give him an 
optimal strategy better than the minimax 
strategy. As mentioned above, that some 
sender emits false messages is one of the 
typical features of human communica- 
tion, and in this case the receiver could 
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find certain decoding rules which pro- 
vide him with correct information if the 
sender’s strategy of deceiving is in some 
way stereotyped, because a false mes- 
sage is not a random message and it 
contains information. 

Another point we require of game 
theory is much more serious. The the- 
ory of games assumes that the values of 
objective probabilities in chance moves 
are known to all participants. This, 
however, is scarcely realized in the 
actual situations of games. In poker, 
for example, where the probability of 
each hand is objectively determined, the 
exact values of probability are not 
known to the participants. If one asks 
them what the exact values are, even 
the most skilled players would not be 
able to tell. Nevertheless, the consider- 
able rationality observable in the strate- 
gies of skilled players implies that they 
have some approximate values of prob- 
ability, perhaps subconsciously obtained 
through their experience in the game; 
the necessary information has undoubt- 
edly been given to them from their previ- 
ous experience, i.e., from the time series 
of poker products previously observed. 
In other words, it is contended that all 
the values of probabilities with which 
poker players play are estimated values, 
and the superiority of skilled players is 
due, at least partly, to the fact that the 
numbers of their observed samples ex- 
ceed those of beginners. The fact that 
the estimated values of probabilities 
could be different from participant to 
participant has such grave influence 
upon the properties of the game that 
some strictly determined games could 
no longer be strictly determined, since 
the good strategy of a participant ig- 
norant of the true values is, at best, to 
identify his estimated values with the 
true ones. This is an essentially im- 
portant point in human communication. 


Masanao Topda 


MULTIPLE INFORMATION EXTRACTION 
FROM ONE AND THE SAME SIGNAL 


The meaning of meaning has been 
one of the most tempting subjects in 
human communication theory, and a 
complicated skein of various meanings 
has been attached to the term. Though 
it is certainly difficult to unravel the 
whole skein, a considerable clearing-up 
nevertheless seems possible by the tra- 
ditional way of thinking in the com- 
munication theory. 

Suppose a man hears another’s voice 
making the sounds, “My wife is ill.” 
Then we can say that the signal caught 
by his ear means that “the words just 
spoken are ‘my wife is ill.’” This is 
the only one that may be called mean- 
ing which is dealt with in Shannon’s 
theory, and hereafter will be referred to 
as primary meaning. There certainly 
can be others meant by the same signal, 
for instance, the semantic meaning that 
the speaker’s wife is ill and is neither 
healthy nor dead. It is obvious that 
this semantic meaning as well as the 
primary meaning contains information, 
since it reduces the uncertainty about 
the physical conditions of the speaker’s 
wife, just as the primary meaning re- 
duces the uncertainty about the trans- 
mitted signal. These two kinds of 
meaning are both information contents, 
and hence can be treated in the same 
theoretical framework. The same argu- 
ment holds even when a physical or a 
semantic noise has intervened—e.g., a 
large crashing sound or a strong foreign 
accent of the speaker. The semantic 
meaning in this case is still an informa- 
tion content, though turned by equivo- 
cation into a more vague one such as 
“The speaker’s wife is likely ill.” 

It should be noted that the semantic 
meaning is not the only secondary 
meaning. The words may also mean 
that the speaker is then short of money. 
Very likely this is just what he wished 
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to mean by the words. Or, perhaps, 
the signal might be a sign meaning, en- 
tirely independently of its semantic 
context, say, that it is time to start to 
act. Or, it might mean that the speaker 
is a liar, if the listener incidentally 
knows that the wife is quite well. In 
any case, the meaning is an information 
content, so far as it indicates this or 
that among the whole set of possible 
alternatives. 

Ideas introduced in the above para- 
graphs, with the exception of multiple 
information extraction, are not entirely 
new in communication theory. More- 
over, multiple information extraction it- 
self is really no more than the conse- 
quence of succession of different decod- 
ings. Presumably a considerable part 
of the receiver’s functions and opera- 
tions, which have been discussed by 
various investigators with different ter- 
minologies, could properly be reduced 
into the following two activities: the 
selection of a set of alternative states 
in relation to which information is ana- 
lyzed, and the establishment of a decod- 
ing rule which transforms a particular 
signal to a particular information con- 
tent. 

These considerations suggest that a 
further attack on the problem of mean- 
ing, using the ordinary language of 
communication theory, is possible by re- 
garding meaning as a kind of informa- 
tion content, provided that an appropri- 
ate measure of the amount of informa- 
tion can be assigned to the contents. 


AMOUNT OF INFORMATION AND 
SUBJECTIVE PROBABILITY 


When a psychologist’s interest is fo- 
cused on the human receiver, he should 
remember that the Shannon-Weaver 
measure of the amount of information 
is the measure prepared for the out- 
sider—neither receiver nor sender— 
who is watching and controlling the 
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communication system, and whose only 
interest is the mean rate of informa- 
tion transmission, namely, — f;plogopi, 
where f is the signal frequency. Such 
a value prepared for or measured by 
the outside controller is, as a matter 
of fact, entirely irrelevant to the actual 
human receiver, who cannot usually 
control at will the sender’s rate of in- 
formation transmission. Though natu- 
rally the receiver will hope that the 
rate of information transmission will in- 
crease, for him the most meaningful 
measure of the amount of information 
must be the amount of information in- 
take, which he can vary by changing 
his decoding rules. The receiver’s meas- 
ure of the amount of information is thus 
to be the measure of extracted informa- 
tion content, primary or secondary. It 
is further required that this measure be 
defined for the individual content and 
not for the ensemble of contents. In 
actual human communications the re- 
ceiver cannot usually get signals fre- 


quently enough from the same ensemble 
to be able to estimate the mean rate of 
information transmission (or intake) 


within the time available. If he waited 
too long, the improvement of his decod- 
ing rules would be practically ineffec- 
tive, or more likely, the statistical prop- 
erties of the ensemble might change 
during the time, even though they were 
once stable. 

Such a definition also will be called 
for by an economic point of view. As 
Marschak states (1), information con- 
tents can be regarded as commodities 
that can be sold and bought at a certain 
price. Most information contents are 
traded in singly and not as an ensemble, 
and, though the utility of information 
is certainly a more complex notion than 
the amount of information, it is natural 
to suppose that a higher price somehow 
corresponds to a larger amount of in- 
formation among the same kind of in- 
formation content. 
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At first sight, to define such a meas- 
ure may seem quite simple. If we let p 
be the probability of getting a particular 
information content, then the amount of 
information of the content is — logsp 
bits. It is obvious that this measure 
for single content is compatible with 
Shannon and Weaver’s measure for en- 
semble. Practically, however, this defi- 
nition is meaningless. Who among ordi- 
nary human receivers knows of the 
objective statistical properties of en- 
sembles? As mentioned previously, we 
should drop the assumption that all the 
true probabilities are known by receiv- 
ers. The values are estimated, and not 
given from the start. Accordingly, the 
above definition should be altered, and 
though the alteration is slight, it would 
have a far-reaching influence on the con- 
cept of information. Let q be the esti- 
mated value of the probability of get- 
ting an information content. Then the 
amount of information of the content 
for the receiver is defined as — logeq 
bits. Remember that this is the defi- 
nition of amount of information and not 
that of estimated amount of informa- 
tion. I am convinced that this is the 
only reasonable measure for our pur- 
poses, and might be, in a sense, a more 
fundamental measure than Shannon and 
Weaver’s. The following paragraphs are 
of central importance to this point. 

Consider a simple true-or-false prob- 
lem which a person does not know how 
to answer. Its correct answer certainly 
contains a finite amount of information, 
since by knowing the answer his uncer- 
tainty vanishes and he knows more than 
he did before. The value of the amount 
of information is either —Jlogog or 
— logo(1 — q), depending on whether 
the correct answer is “true” or “false,” 
where gq is his estimated probability that 
the answer will turn out to be “true.” 
What is the objective probability in 
this case? If it is to be one or zero, 
such an answer could contain no infor- 
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mation in the classical sense. This diffi- 
culty cannot be avoided by taking as 
the objective probability the frequency 
that the answer has been “true” in past 
similar situations. Though his value of 
probability qg might possibly be deter- 
mined in a somewhat similar manner, 
no one could find any mathematically 
rigorous probability of such type, or any 
statistically stationary ensemble of simi- 
lar situations. The simple true-or-false 
problem can be replaced by the true-or- 
false problem of any unproved mathe- 
matical statement. As Brouwer once 
pointed out, an unproved mathematical 
statement is in a state somewhere be- 
tween true and false. But who knows 
or can determine the objective prob- 
ability that some unproved mathemati- 
cal statement will turn out to be true? 
Nevertheless, no one would deny that 
its proof (whether affirming or negat- 
ing) contains information, and we have 
to measure its amount. 

Perhaps the reader will admit the 
well-known definition that information 
is something which reduces the receiv- 
er’s uncertainty. Then he would also 
have to admit the possibility that a 
message brings different amounts of in- 
formation to different receivers, since 
the values of uncertainties could be 
quite diverse among actual human re- 
ceivers. For instance, if there are two 
receivers, one of whom knows of some- 
thing in its every detail with no remnant 
of uncertainty and the other has very 
poor knowledge about it and thus has 
very great uncertainty, it is easily seen 
that a message on the subject will give 
the former nothing and the latter a large 
amount of information. These facts 


clearly agree with our definition of 
amount of information, in which mathe- 
matically true probabilities are replaced 
by estimated probabilities or subjective 
probabilities. Since subjective probabil- 
ity is measurable by some experimental 
techniques which adequately separate it 
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from utility (5), the definition stands 
on a definite operational ground. 


THE OBSERVAND AND Its CoMPLETE 
Set oF STATES 


As explained in the preceding section, 
a human receiver may derive from the 
same message several information con- 
tents, each referring to a different sub- 
ject, such as, “the words just spoken,” 
“the physical condition of a man’s wife,” 
or “his monetary condition.” In order 
to avoid needless ambiguities we shall 
call the subject of an information con- 
tent the observand, and any informa- 
tion content will be written in the form 
“An observand X is in a state x;,” or, 
in the above example, “The wife’s physi- 
cal condition” is in the state “ill.” Usu- 
ally there is more than one state of an 
observand forming a space to which a 
subjective probability measure is as- 
signed. Roughly speaking, the set of 
states is complete when it is exhaustive 
and the states are mutually exclusive. 
“Healthy,” “ill,” and “dead” offer an 
example of a complete set for the ob- 
servand “one’s physical condition.” Of 
course, there can be other complete sets 
for the same observand; one’s physical 
condition could be classified by com- 
pletely different and independent cri- 
teria. It seems proper, however, to re- 
gard these independent sets as belong- 
ing to different observands, since infor- 
mation contents described by each of 
them can also be independent of each 
other. 

Given any complete set of states, it is 
possible to create a new complete set 
that is a relative of the first by simply 
combining some of the states, or, con- 
versely, by dividing some of the states 
in the original set. Which one of these 
related states the receiver decides to 
use is of great importance, since both 
the information content and the meas- 
ure of the amount of information de- 
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pend on the selection. It is evident that 
the amount of information will increase 
as the states are subdivided, but there 
are limits beyond which further divi- 
sions are futile. One obvious limit on 
how far the receiver should go in dif- 
ferentiating states is reached when dif- 
ferentiation does not produce a more 
accurate identification of the observand. 
If a receiver used one of the overdiffer- 
entiated sets, he could obtain only a 
type of information content such as 
“The observand is in a particular subset 
including more than one state”; and as 
to any further precise location of the 
observand’s position among the states 
in the subset no information could be 
drawn from the signal. Another re- 
striction may be a physiological one. 
It is most probable that some principle 
of economy or efficiency affects the se- 
lection of the set, since it is likely that 
the smaller the number of alternative 
states the less the energy, the smaller 
the number of necessary physiological 
elements, and the less the probability 
of error in analyzing and memorizing 
information. Further, we must con- 
sider social influences on the selections 
of the set. In a social group, whose 
members have assembled to cooperate, 
communications should be performed 
most efficiently, and it is quite natural 
that common sets shared by the group 
members are established for the ob- 
servands which are significant for the 
group activity. Perhaps the social-psy- 
chological concepts of group norm and 
shared frame of reference are relevant 
to such common sets (2). 


GENERALIZATION OF DEFINITIONS 


We have already described two types 
of information content, “X is in x;” and 
“X is in S;,” where X, x, and S; signify 
an observand, its i-th state and its i-th 
subset of states, respectively. More 


precisely, however, these contents should 
be expressed as “X is in x (or in S;) 
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with probability one.” Once expressed 
in such a form, it is obvious that they 
are special types of information con- 
tent. (It should be particularly noted 
that there is no necessity for coincidence 
of the forms of verbal message and the 
information content extracted from it. 
At least one reason for the fact that 
verbal messages are apt to take the form 
of absolute judgments may be found in 
their relative immunity from noise at- 
tack.) Thus, an information content 
will more generally be expressed in such 
form as “X is in S;, S;, Sy, . . . with 
probabilities ~;, ~;, px, ... , respec- 
tively,” where S’s are subsets of states. 
The probabilities are of course subjec- 
tive probabilities estimated from the 
receiver’s past experience, which might 
have told him that, when he obtained 
such signals from such an information 
source in such a situation, the observand 
was in such and such states with such 
and such frequencies; or perhaps it 
might have told him more. Then we 
can formulate: An information content 
is a subjective probability measure as- 
signed to a complete set whose elements 
are mutually disjunct subsets of the 
original complete set with which the in- 
formation is analyzed. If any one of 
the subsets contains more than one 
state, the information content will be 
said to be degenerating in that subset. 

Now we have to generalize the defi- 
nition of amount of information ac- 
cording to the generalization of the 
definition of information content. This 
is a rather simple task when it is ab- 
solutely certain that the probabilities 
given by the obtained information con- 
tent are the objective probabilities. Let 
the given objective probabilities and the 
existent subjective probabilities be 1, 
po, . . . and qi, qo, . . . , respectively. 
Then if it is assumed to be known that 
X is really in x, the amount of informa- 
tion obtained is — logeq; and — logop; 
before and after the objective probabili- 
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ties are given, respectively. Accord- 
ingly, the mathematical expectations of 
amount of information are — 3;plogoq: 
and — 3,pjog p; before and after the 
probabilities are given, respectively. It 
is easily verified that the former is not 
less than the latter. Hence the expecta- 
tion of amount of information given by 
the information of objective probabili- 
ties is positive and equal to — 3,plogoq 
+ X;pilogep;. We shall call this quan- 
tity the information gain, and it coin- 
cides with the definition of amount of 
information previously offered when the 
given probabilities are one and zero. 
Even when it is not absolutely certain 
that the given probabilities are objec- 
tive, we would use the same equation to 
estimate the information, as long as we 
are convinced that the given probability 
distribution is far more authentic than 
the original subjective probability dis- 
tribution. Difficult problems arise when 
neither one of the distributions has such 
a predominant authenticity. This prob- 
lem, however, has to be treated from 
a more general point of view. 


SYNTHESIS OF INFORMATION 


It is a common experience to receive 
from different information sources vari- 
ous information contents about the same 
observand which more or less contradict 
each other; this leaves one uncertain as 
to what is the most reliable information 
content. This is the problem of syn- 
thesis of information contents, which is 
important not only for individuals but 
also for social organizations; and our 
lines of thought seem to provide its ra- 
tional solution. Because of the consid- 
erable complexity of the problem, how- 
ever, only a brief outline of possible 
solutions will be shown here, of which 
the more detailed discussions are found 
in another paper by the author (6). 

Suppose that there are m — 1 number 
of different information contents given 
to a receiver. Then there are m num- 
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ber of different probability distributions, 
including his own subjective probability 
distribution possessed in advance. Usu- 
ally each distribution is different in its 
reliability, according to its source and 
the situation in which it is given. One 
quite simple way of obtaining the most 
reliable probability distribution is to 
compute weighted means of probabili- 
ties for each state, where each weight 
represents the reliability of the corre- 
sponding information content. This idea 
of simple linear combination can be 
made more convincing by the follow- 
ing sample case: It is known that some 
one of the alternative distributions is 
the objective probability distribution, 
and the probability that the i-th dis- 
tribution D; is objective is W;, (i = 7, 
2,..., m). Let py denote the prob- 
ability given by D, for j-th state (or 
subset), and let 7; denote the most re- 
liable probability to be obtained, (i = 1, 
: ,mandj=1,...,n; n is the 
number of alternative states or subsets). 
If it happens that D, is true, then the 
expected amount of information that he 
loses by adopting the distribution R in- 
stead of D; is — S;Pilogopi; + S;Pylogor;. 
We shall call this quantity the informa- 
tion loss, which has just the same form 
as that of the information gain, but with 
the negative sign, and is never posi- 
tive. Then the expected information 
loss is 3,W4(— S;pilogopis + X;Pilogor;). 
This value is a function of R, and it is 
easily proved that this function takes its 
minimum absolute value when R is the 
weighted mean where the weights are 
the probabilities W,. 

This is only a sample case in which 
the implication of the method is quite 
clear. Ina more general case the weights 
need not necessarily be comprehended 
as probabilities. Perhaps the statement 
that the weighted mean solution mini- 
mizes the weighted mean of information 
loss has value in its own right. This 
solution will be called the minimum in- 
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formation-loss estimate of probability 
distribution. 

There is another kind of evidence 
justifying this method. In the above 
discussion it has been tacitly assumed 
that none of the information contents 
degenerates, or, if they do, that they 
all degenerate in the same subsets. Of 
course this cannot be assumed generally. 
If some of the information contents de- 
generate differently, it is necessary for 
linear combination that the probability 
assigned to a subset be divided into 
each of its constituent states. It is 
proved that the division proportional to 
the distribution of r; in the subset gives 
the minimum information-loss estimate 
by solving the resultant system of equa- 
tions. This is a completely rational 
solution, because, in the case of an en- 
tirely degenerated information content 
assigning probability one to the whole 
set, and therefore meaning nothing, the 
only possible division which does not 
change the solution by adding this nomi- 
nal content is the proportional division. 

It should be noted, however, that this 
method is applicable only to a special 
kind of information-content ensemble— 
of which all the contents are possibly 
more or less false. Another extreme 
case will be found in the ensemble of 
which contents are more or less incom- 
plete but not false. For this ensemble 
the solution is obtained by summing 
— log ps with respect to # for each j 
(a kind of additivity of amount of in- 
formation holds in this case!); the re- 
sultant probability is obtained from 
II,fi; by normalization. As in the previ- 
ous case, we can define information-loss 
function as — Syogor; + Srloge(Mipi;/ 
S,;Il;py), and it is easily seen that r; = 
Il,piy/;Mipy gives this nonpositive func- 
tion its maximum value zero. 

An experiment conducted by the au- 
thor to detect what rule of information 
synthesis is adopted by undergraduate 
students in the latter case revealed a 
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rather surprising predictability by the 
above formula for group means. Un- 
doubtedly, this problem of information 
synthesis deserves the attention not only 
of experimental psychologists and infor- 
mation theorists but also of many other 
people who are engaging in the synthe- 
sis of information, since obviously they 
have no rule but intuition for mak- 
ing their syntheses, except when some 
appropriate statistical techniques are 
available. 


SUMMARY 


Communication theory as developed 
mainly for engineering purposes is in- 
complete as a theory of actual human 
communication. The required theory 
should pay proper attention to the fol- 
lowing facts: (2) Human communica- 
tion is a kind of game, sometimes co- 
operative and sometimes competitive. 
(6) The receiver’s rules of decoding are 
usually not given; they should be de- 


veloped by the receiver himself for each 


different information source. (c) The 
receiver may extract more than one in- 
formation content from a signal. (d) 
The receiver is usually not given objec- 
tive probabilities, but he estimates them. 
Accordingly, the amount of information 
and information content should be de- 
fined with respect to subjective prob- 
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abilities (5). Particular attention should 
be paid to the problem of information 
synthesis. Brief consideration has been 
given to all these topics. 
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